Journal of Applied Psychology 


Joun G. Dar ey, Editor 
University of Minnesota 





Table of Contents 


Guttman-Type Scales for Union and Management Attitudes Toward Each Other: 
R. Stagner, W. E. Chalmers, and M. Derber 


The Attitudes of West Texas College Students Toward School Integration: H. 
Greenberg and Dolores Hutto 


Anxiety Level and Score on a Biographical Inventory: D. H. Kausler and E. P. Trapp 
Development of a Criterion of Research Productivity: R. E. Stoltz 


Performance as a Function of Control-Display Relations, Positions of the Operator, 
and Locations of the Control: M. Humphries 


The Postwar Occupational Adjustment of Emotionally Disturbed Soldiers: J. B. 
Miner and J. K. Anderson 


Adequacy of the Residual Sensory Cues for Psychomotor Performance of Arm Ampu- 
tees: Hilde Groth and J. Lyman 


Curriculum Assessment with Critical Incidents: A. S. Glickman and T. R. Vallance 
A Personality Inventory Employing Occupational Titles: J. L. Holland 

Length of Work Periods in Visual Research: M. A. Tinker 

Communication Restraints, Group Flexibility, and Group Confidence: R. C. Ziller 


The Comparative Effectiveness of Some Psychological and Physiological Measures 
in Ranking the Impact of Diverse Environmental Conditions: B. J. Fine 





American Psychological Association 


Volume 42, Number 5 October, 1958 








Consulting Editors 


Harotp E. Burtt, Ohio State University 

AtpHoNseE CHapaNnis, Johns Hopkins Uni- 
versity 

Currrorp E. JurcenseN, Minneapolis Gas 
Company 

Laurence S. McGaucuran, University of 
Houston 

Quinn McNemar, Stanford University 


—— - Mintz, City College of New 

or. 

Haroitp F. Rotue, Fairbanks, Morse and 
Company 

Juan B. Rotter, Ohio State University 

Tuomas A. Ryan, Cornell University 

Donatp E. Super, Columbia University 

Mues A. Tinker, University of Minnesota 

Atrrep C. Wetcu, University of New 
Mexico 


Artur C. Horrman, Managing Editor 
Hetew Orr, Promotion Manager 
Saran Womack, Editorial Assistant 





This journal gives primary consideration to origi- 
nal investigations in any field of applied psychol- 
ogy except clinical and consulting psychology, al- 
though a descriptive or theoretical article may be 
accepted if it represents a special contribution in 
an applied field. Quantitative investigations of in- 
terest or value to psychologists working in the fol- 
lowing broad fields will be considered: vocational 
and educational prognosis, diagnosis, and guidance 
at the secondary and college level; personnel re- 
search in business, industry, and government; bio- 
mechanics; industrial working conditions; research 
on opinion and morale factors; job analysis and 
classification research; market and advertising re- 
search. 


Because of the large number of manuscripts sub- 
mitted, authors should adhere to the rule of 


“brevity consistent with clarity.” The typical 
manuscript should run to approximately 4,000 
words. There is a lag of approximately twelve 
months between receipt and publication of an 
article. Authors may request advanced publica- 
tion if they are prepared to pay the cost of print- 
ing the necessary extra pages. 


Manuscripts should be addressed to the Editor, 
John G. Darley, 408 Johnston Hall, University of 
Minnesota, Minneapolis 14, Minnesota. All manu- 
scripts should be submitted in duplicate. Original 
figures are prepared for publication; duplicate fig- 
ures may be photographic or pencil-drawn copies. 


Manuscripts must conform to the style require- 
ments described in the Publication Manual of the 
American Psychological Association. 





Journal of Applied Psychology 


Published bimonthly by the 
American Psychological Association 
Prince and Lemon Sts., Lancaster, Pa. 
and 1333 Sixteenth Street N.W. 
Washington 6, D. C. 


$8.00 per volume 


Subscriptions, orders, and 
1333 Sixteenth St. N.W., Washington 6, D. C. Address changes must reach the subscription 


the month to take effect the following month. Undelivered cop’ 


ies resulting from address 


subscribers should notify the post office that they will guarantee second-class forwarding 
undelivered copies must be made within four months of publication. 


Entered as second-class matter, August 19, 1943, at the post office at Lancaster, Pa., under the act of March 3, 1879. 
for mailing at the special rate of postage provided for in paragraph (d-2), Section 34.40, P. L. & R. 


Acceptance 
of 1948, authorized October 10, 1947. 


© 1958 py the American Psychological Association, Inc. 








- Journal of Applied Psychology 








VoL. 42, No. 5 


OctToBER, 1958 








Guttman-Type Scales for Union and Management Attitudes Toward 
Each Other * 


Ross Stagner 


Wayne State University 


W. E. Chalmers and Milton Derber 


University of Illinois 


The importance of “the attitudes of the 
parties toward each other” has long been rec- 
ognized in industrial relations. Many wri*ers 
have used the concept, with and without ref- 
erence to specific indices of attitude. How- 
ever, so far as we know, no one has done any 
systematic work on the preparation of scales 
for the quantitative measurement of this vari- 
able. Such scales as have been published 
(e.g., in the Illini City studies, [4] ) have been 
arbitrarily put together and have not been 
subjected to scaling analysis. Many of these 
scales have included specific items referring to 
local situations, hence were not useful in com- 
parative studies across establishments. 

For the above reasons, it seemed desirable 
to construct generalized scales which could be 
presented to management and union officers 
for the purpose of obtaining a quantified 
statement of attitude. The substantive con- 
text within which the scales were to be used 
is reported in other publications (see, e.g., 
3). The present report is therefore limited 
to the technical characteristics of the scales 
themselves. 


1We should like to express our appreciation to 
Milton Edelman of Southern Illinois University and 
to former graduate assistants, Herbert Schaeffer, 
Robert Ver Nooy, Robert Mitchell, Sheldon Luskin, 
and John Tipton, who aided in various phases of 
the collection and analysis of the data. 

2It should perhaps be noted that the items used 
in the Illini City studies with rank-and-file em- 
ployees were analyzed by Williams (5) who found 
that both the attitude-to-company and attitude-to- 
union items formed acceptable scales. However, 
these were not used with union or company officials. 


293 


Selection of scale items. Since it was our 
desire to prepare a generalized scale covering 
the major elements which might be involved 
in the “attitude of management toward the 
union” or in the “attitude of union toward 
management,” we selected items from previ- 
ous scales, and from discussions in the litera- 
ture, which seemed likely to have such gen- 
eralized applicability. Thus, in the scale for 
unions, items were included about manage- 
ment policies, about formen, middle and top 
management, about managerial use of power, 
and so on. Parallel questions were framed to 
be presented to executives for attitude toward 
the union. 

Respondents. The scales presented here 
are apparently unique also in another respect. 
They are based upon pooled answers from 
two respondents, instead of treating each re- 
spondent as a separate member of the popu- 
lation. This was an important component of 
our research design. Data regarding each es- 
tablishment (with few exceptions) were based 
on the pooled answers of the two top union 
officials, those who bargained directly with 
the company, and on the pooled answers of 
the two top executives who handled union re- 
lations for management. This procedure was 
followed to increase reliability and to decrease 
the role of subjective biases in the perception 
of situations.* Specifically, in the case of the 

* For example, there were numerous disagreements 
between our pairs of managerial respondents even on 


“factual” issues such as seniority practice. The use 
of two informants presumably gave us more “objec- 





294 


attitudinal material, we wanted a figure which 
represented “the position of management,” 
not a single person’s attitude, and similarly 
for the union. 

The analysis reported here is based upon an 
N of 41, meaning that interviews were con- 
ducted in 41 establishments in three down- 
state Illinois communities. Actually, 76 ex- 
ecutives were interviewed (in 6 companies, 
only one executive could be found who was 
actively involved in union relations). Simi- 
larly, the total number of interviews was 81 
for union officials; in all but one case, two 
men were interviewed to obtain the union po- 
sition. Interviews were conducted in plant 
or union offices by our field interviewers. 
Each item with its response alternatives was 
presented, and the interviewer marked the re- 
sponse selected. Respondents had no oppor- 
tunity to consult with each other regarding 
answers to these items. 

The population of 41. establishments has 
been described elsewhere (1). Suffice it to 
say that the range of sizes (hourly paid em- 
ployees) was from 73 to 2,100 employees; 
some were in manufacturing, others in utili- 
ties and services; among the manufacturers, 
both producer and consumer goods were in- 
volved; and a wide variety of unions func- 
tioned as bargaining agents. However, these 
41 firms-were restricted to cases which were 
not branches of larger organizations, and to 
cases in which a single union bargained for a 
majority of the work force. 

Since each item was presented as a four- or 
five-alternative multiple-choice question, pool- 
ing was accomplished simply by taking the 
average of the positions chosen. Perfect 
agreement between the two spokesmen was 
not common. For managers, agreement was 
found on five items in at least 50% of firms. 
For unionists, seven items showed agreement 
in at least 50% of the cases. Since 14 items 
were used, it is clear that neither “the man- 
agement position” nor “the union position”’ is 
clearly established and unequivocally stated 
in the typical relationship. However, it should 
be noted that differences in response were 


tive” data than reliance on a single person, especially 
since we reinterviewed on “factual” questions in case 
of disagreement. No reinterviews were conducted on 
attitudinal items. : 


Ross Stagner, W. E. Chalmers, and Milton Derber 


usually limited to a single step on the an- 
swer scale. 

Averaging the responses of two persons im- 
plies the assumption that the answer steps 
represent equal intervals. This assumption 
has not been tested. Since the final product 
is presumed to be only an ordinal scale, some 
inequality of answer steps would seem not to 
introduce a serious question. 


Procedures 


The task of this analysis, in generalized 
terms, is to determine the functional unity of 
the specific attitude items, i.e., to test the 
hypothesis that there is a single point on a 
hypothetical continuum which correctly rep- 
resents the position of one party’s attitude 
with respect to the other. This would be in 
contrast to an equally plausible finding, viz., 
that each attitude is truly composed of a 
number of independent components, and that 
the selection of an index number to designate 
one party’s position is merely a case of aver- 
aging positions on specific issues not intrin- 
sically related one to another. Research on 
union-management relations has commonly as- 
sumed that these attitudes are unitary; we 
felt it important to test this assumption. 

Two methods are commonly used for es- 
tablishing the unitary nature of a function 
revealed in a series of discrete responses: 
correlational analysis; and scaling technique. 
For purposes of cross-comparisons, we used 
both. Phi coefficients were computed between 
all pairs of items, to permit determination of 
clusters of related items; and Guttman’s 
scalogram analysis (3) was also applied to 
the data. The following items were used in 
the analysis (the items are ranked as in the 
final scale; marginal numbers indicate posi- 
tion in original questionnaire; numbers in 
parentheses, such as (2.5), indicate the cut- 
ting point between an answer considered fa- 
vorable and one considered unfavorable) : 


Scale for Management Attitude 
Toward Union 


43. Are the union officers effective leaders of their 
organization ? 
(1)———very much so 
(2)-——pretty good 
(3)———mediocre 


(4)———-very poor (2.5) 





Scales for Union and Management Attitudes 295 


Is the union generally reasonable or not in its 
claims ? 

(1) very reasonable 

(2)——teasonable most of the time 
(3)—frequently unreasonable 
(4)———extremely unreasonable (2.5) 


Does the union interfere seriously with how the 
company is managed, or does the management 
have a reasonably free hand in running the 
plant ? 


(1)——union is no problem 

(2)——it interferes a little but not seriously 

(3)——it interferes quite often 

(4)———it seriously interferes with management 
(2.0) 


Are the union officers interested in the welfare 
of the rank-and-file workers ? 

(1)——-very much so 

(2)———pretty much 

(3)——slightly 

(4)——-very little (2.0) 

Does the union cooperate with management on 
production matters or not? 


(1)———they are extremely cooperative 

(2)—they will go along but not positively 
support 

(3)———they do not interfere seriously but some- 
times are obstructionist 

(4)——they _ restrict production improvements 
quite often (2.5) 

In general, how do you personally feel about 

your company’s relations with the union? 

(1) ——-very satisfied 

(2)———moderately satisfied 

(3)———moderately dissatisfied 

(4)———-very dissatisfied (2.0) 


Has the union tended to weaken employee disci- 
pline, or has it cooperated with management on 
disciplinary matters? 
(1)——cooperative and helpful 
(2)———sometimes helps but not always 
(3)———-sometimes interferes with discipline 
(4)——-has created some serious disciplinary 
problems (2.5) 
Does the union have too much power in your 
establishment ? 
(1)——not too much 
(2)——too much in a few respects 
(3)———too much in many respects 
(4)——-far too much (1.5) 
Does the union have the support of the workers? 
(1)——most of the workers are strongly behind 
it 
(2)———only a few really active people but most 
workers go along 
““(3)——not too much feeling either way 
(4)——a lot of the workers are hostile (1.0) 


37. How do you feel about using the union as the 


main channel of communication to the workers 
on company policies? 

(1)———strongly favor 

(2) moderately favor 

(3)———moderately oppose 

(4)———strongly oppose (combined with 41) 
Are the local union officers skillful bargainers? 


(1)———-very much so 

(2)——pretty good 

(3)-——mediocre 

(4)———-very poor (favorable if 37 +41=3 or 
less) 


following items did not scale: 


Are the international union representatives skill- 
ful bargainers ? 

(1) very much so 

(2)———pretty good 

(3)———mediocre 

(4)———-very poor 

(5) ——none are involved 

Do the international union officers create any 
serious problems or not? 


(1)———they are generally responsible and helpful 
(2)———they are more helpful than troublesome 
(3)———they are more troublesome than helpful 
(4)——they generally stir up trouble 
(5)———none are involved 

Does the union try to live up to its agreements? 
(1)——always 

(2)-———usually 

(3)———frequently does not 

(4)——rarely 


Scale for Union Attitude Toward 
Management 


Are the top management officials effective execu- 
tives of the establishment ? 

(1)———very much so 

(2)——-pretty good 

(3)———mediocre 

(4)——-very poor (2.0) 

What is the top management attitude toward the 
union? 

(1)———strongly favorable 

(2) moderately favorable 

(3)———moderately unfavorable 

(4)——strongly unfavorable (2.0) 

Does the company try to live up to its agree- 
ments? 

(1)——always 

(2)——usually 

(3)———frequently does not 

(4)——rarely (2.0) 





296 


46. Does the company abuse its power in this estab- 
lishment ? 
(1) rarely 
(2)——occasionally 
(3)——frequently 
(4)——-very often (1.5) 


. In general, how do you personally feel about 
your union’s relations with the company? 
(1) ——-very satisfied - 
(2)——moderately satisfied 
(3)——moderately dissatisfied 
(4)——-very dissatisfied (2.0) 

. Has the management shown any understanding 
of your problems as a union officer? 


(1) very understanding 

(2)——understands the union situation pretty 
well 

(3)——understanding of union problems is lim- 
ited 

(4)——little or no understanding of union prob- 
lems (2.0) 


. Has the management tried to undermine the un- 
ion position through direct dealings with the 
workers, or has it been careful to safeguard the 
union position in such contacts? 

(1)—— is always careful not to hurt union 
(2)———is usually careful not to hurt union 
(3)——occasionally tries to weaken union 
(4)-——frequently tries to weaken union (2.0) 

. Is the top management generally reasonable or 
not when it comes to discussing union claims? 
(1)———-very reasonable 
(2)——reasonable most of the time 
(3)———frequently unreasonable 


(4)———extremely unreasonable (1.5) 


. Are the top management officials interested in 
the welfare of the workers? 
(1) ———-very much so 
(2) pretty much 
(3)——slightly 
(4)——-very little (1.5) 


following items did not scale: 


. Do the foremen, in general, act toward the un- 
ion in the same way as top management? 
(1)——the foremen are very much easier to get 
along with 

(2)——the foremen are somewhat easier. to get 
along with 

(3)———the foremen are about the same as top 
management 

(4)——the foremen are somewhat more difficult 
to get along with 

(5)——the foremen are much more difficult to 
to get along with 


. Does middle management, in general, act toward 
the union in the same way as top management ? 


(1)——the middle management are very much 
easier to get along with 


Ross Stagner, W. E. Chalmers, and Milton Derber 


(2)——they are somewhat easier to get along 
with 
they are about the same as top manage- 
ment 
(4)——they are somewhat more difficult to get 
along with 
(5)——they are much more difficult to get along 
with 
. Are the top management officials skillful bar- 
gainers ? 


= -~e™ 


(1)———very much so 
(2)——-pretty good 
(3)———mediocre 
(4)—-very poor 

. (If a multi-plant company with a home office 
outside this establishment) Are the company 
representatives from the home office skillful bar- 
gainers ? 
(1)———very much so 
(2)———pretty good 
(3)——mediocre 
(4)———-very poor 
(5)——not involved 
(If a multi-plant company with a home office 
outside this establishment) Do company repre- 
sentatives from the home office create serious 
problems or not? 
(1)—they are generally responsible and helpful 
(2)———they are more helpful than troublesome 
(3)———they are more troublesome than helpful 
(4)—they generally stir up trouble 
(5)——not involved 


Results 


1. Attitude of management toward the un- 
ion. The 14 X 14 matrix gives 91 phi coeffi- 
cients, of which 26 are positive and signifi- 
cantly above zero,‘ while only 13 are nega- 
tive, one of these being significant. 

This finding would suggest the existence of 
a single common factor. However, inspection 
of the data indicates that only 7 items are 
closely related; 15 of their 21 intercorrela- 
tions are positive and significant. Of the re- 
maining 70 coefficients, only 11 are positive 
and significant. Hence, the common variable 
appears to be defined by such items as 34 and 
46 (how do you personally feel about com- 
pany’s relations with union, and does union 
have too much power). Next are items 36, 
39, 40 and 47, followed closely by item 38. 

* Since the data were dichotomized to compute phi 
coefficients, significance was tested by finding the 5% 
value of chi square for df=1. This value, 3.84, cor- 
responds to phi of .293. Copies of tables of phi 


coefficients may be obtained by writing to M. L. 
Derber, University of Illinois, Urbana, Illinois. 





Scales for Union and Management Attitudes 


The. remaining seven items have few signifi- 
cant correlations either with the previously 
listed seven or among themselves. A deci- 
sion based solely on this set of data would 
undoubtedly have been to take the seven 
items with a majority of the significant posi- 
tive coefficients and use them as a scale, re- 
jecting the remaining seven which show ran- 
dom interrelationships. 

The second analysis was based on the Gutt- 
man procedure. An arbitrary score was com- 
puted for each establishment by summing the 
values of the answers to the 14 items; and the 
establishments were ranked in order of favor- 
ableness toward the union. Responses to the 
14 items were then tallied in 14 columns, with 
the item showing the largest number of favor- 
able opinions at the left, and decreasing to- 
ward the right. Cutting scores were then 
sought such that a maximum number of fa- 
vorable responses to a given item would be in 
the upper portion of the column, and unfavor- 
able responses concentrated in the lower por- 
tion. By dropping items, recomputing scores, 
reranking establishments, etc., it was possible 
to minimize the number of “errors” —unfavor- 
able responses above this arbitrary line, fa- 
vorable responses below it. 

In the course of this scaling, Items 42 and 
45 were dropped first. Item 42 had only one 
significant positive phi coefficient, and 45 had 
the single significant negative phi, with no 
significant positives. The results of the two 
methods thus far are closely comparable. In 
a later phase of scaling Item 47 was also 
dropped. On phi analysis 47 had a large 
number of significant coefficients (six). We 
cannot explain why it proved impossible to 
find a suitable cutting point on 47 which 
would relate it suitably to the revised total 
score. Finally, Items 37 and 41 were kept in 
the scale by combining them; i.e., the choice 
of answers on the two items had to add to 3 
or less to be counted as favorable. Other- 
wise, the combined response was treated as 
unfavorable. Thus, the Guttman scaling pro- 
cedure gives us a scale using 10 items, six 
being in the list of seven having high inter- 
correlations. Two items, 35 and 43, were 


5 For precise details of the Guttman method, see 
¥ 


297 


found to scale adequately despite a modest 
number of significant correlations, and two 
others, 37 and 41, were saved by combining 
them to make one scoring unit, a procedure 
which is feasible with Guttman scaling but 
difficult to achieve by straight correlational 
methods.° 

Guttman determines whether he has a “true 
scale” by computing the “coefficient of repro- 
ducibility.”” This is simply the number of re- 
sponses correctly located as favorable or un- 
favorable (see above) divided by the total 
number of responses. For the final manage- 
ment scale, this coefficient is .915, indicating 
the presence of a “true scale” in this case. 

Each item in a Guttman scale is scored as 
having a weight of one point. Thus, there is 
a possible range of scores from zero to nine. 
These are the scores which have been used in 
the substantive analysis of our data. 

2. Union scale. We may now consider the 
parallel scaling of the responses obtained from 
union respondents. It will be noted that the 
items presented to the union spokesmen were 
in general similar to those for management, 
but certain variations were necessary. 

The phi coefficient analysis for the 14 un- 
ion items indicates that the degree of internal 
coherence of attitudes here is somewhat less 
than for the company officials; only 20 co- 
efficients (out of 91) are positive and signifi- 
cant at the 5% level; again, one is negative 
and significant to this degree. 

The common thread in the union items 
seems best represented by Item 44 (what is 
top management attitude to union) and Item 
46 (does company abuse power), which have 
significant correlations with six other items. 
Close behind are 35, 36, and 40 (is top man- 
agement reasonable, has management shown 
understanding, top management interested in 
worker welfare), each of these having five sig- 
nificant correlations. Reasonably integrated 
with the others is Item 34 (how do you per- 
sonally feel about union relations with com- 
pany). One item, 38, had no significant co- 
efficients, and Items 41 and 42 had only one 
each. It would appear that a satisfactory 


®Guttman defends the procedure of contriving 
these combined items on the ground that a pattern 
of responses may provide information not given by 
an isolated response. 





298 


scale could be prepared with Items 44, 46, 35, 
36, 40, and 34, since 14 of the 20 significant 
positive coefficients are found within this 
group; only one of their intercorrelations was 
below the 5% cutting point. 

Turning now to the Guttman analysis, we 
find a situation similar to that in the manage- 
ment data. Four items were dropped at the 
first stage of the analysis: 38, 39, 42, and 45; 
collectively, these four items accounted for 
only three of the 20 significant positive coeffi- 
cients, and for the single negative coefficient 
which reached significance. It would appear, 
therefore, that items which correlate very 
poorly with others in the pool are fairly cer- 
tain to drop out early in a Guttman analysis. 

After rescoring and reranking the establish- 
ments, we found it necessary to drop one 
more item, 41; this item likewise has only a 
single significant correlation. The remaining 
nine items were shown to form a “true scale”; 
the coefficient of reproducibility was .902. It 
is interesting to note that the item with the 
fewest “errors” in the final scaling is 44, 
which was also best in the correlational data; 
but, on the other hand, Item 46, almost as 
good in the correlation matrix, has almost the 
highest number of errors in the Guttman pro- 
cedure. Since similar anomalies were observ- 
able in the management data, we infer that a 
core of items established by intercorrelations 
will agree fairly well with those selected in 
the early stages of the Guttman method; but 
as further precision is sought, the correla- 
tional results do not predict well the final 
scaling of items. 

Guttman considers his scales to be truly 
“ordinal” in character, and instead of speak- 
ing of scores he speaks of “scale types.” How- 
ever, he notes that it does little harm to the 
data to treat everyone making the same nu- 
merical score as belonging to the same scale 
type, even though the pattern of favorable 
and unfavorable answers may not be exactly 
identical for all. We have followed this sug- 
gestion for simplicity, and thus have treated 
all establishments having m favorable answers 
as belonging to scale type m. Since the num- 
ber of errors is small, the number of cases 
which would be reclassified by a different pro- 
cedure is very small. 


Ross Stagner, W. E. Chalmers, and Milton Derber 


Table 1 
Distribution of “Scale Types’’* 





Union to 
Management 


Management 
to Union 





—_ 
own — & Ow 
— 


= 
= 
—- wor ewuuvun w 


0 


jt e 


* We have treated the scale type simply as the sum of favor 
able responses, and statistically have considered it as the 
equivalent of a test score. 

>» The maximum for the union scale is 9. 


The final distribution of scale types is 
shown in Table 1 for both scales. It will be 
noted that the entire attitudinal range is rep- 
resented in our population of establishments 
on each scale; however, there is not a uni- 
form spread along the scale as Guttman would 
consider desirable. The piling up is in the 
“more favorable” categories; this is presum- 
ably fortunate in terms of labor peace. 

An additional technical note is important. 
Ideally, in Guttman scaling, the marginal en- 
tries should be widely and fairly uniformly 
spread; i.e., there should be a range from an . 
item answered favorably by almost everyone 
to one which is answered unfavorably by most 
of the population, and items well spaced be- 
tween these two. Our data do not meet this 
test; despite our choices of cutting-points, the 
items also pile up in the range with large 
numbers of favorable answers. This situa- 
tion could not be corrected without seriously 
increasing the number of “errors.” The er- 
rors remaining in the final scales were ran- 
domly distributed. 

3. “Attitudinal climate.” In our Illini City 
study, the data indicated that a common “atti- 
tudinal climate” could be identified for an es- 
tablishment. This conclusion was based pri- 
marily on the responses of rank-and-file work- 
ers (who were also union members), indicating 
that liking for the company was associated 





Scales for Union and Management Attitudes 


with liking the union, and vice versa. In a 
plant where the workers liked management, 
they seemed also favorable to the union; in 
firms where the management was disliked, the 
union also was in relative disfavor.’ How- 
ever, this relationship did not hold when we 
compared the attitudes of union and manage- 
ment leaders within the establishment. 

In our present study, it is possible to make 
another test of the “attitudinal climate” hy- 
pothesis. We have used reasonably parallel 
questions, asked of managers and of unionists, 
to define the attitude of each toward the 
other. If attitudinal climate generalizes to all 
levels of an establishment, we would expect 
that firms in which management approves of 
the union would also have a union which ap- 
proves of management; i.e., there should be 
a significant positive correlation between the 
two measured attitudes. 

This prediction is not confirmed. The cor- 
relation between the two attitudes, for our 41 
establishments, using the a priori scales based 
on the original 14 items in each questionnaire, 
was .271. This verged on significance, and it 


was thought that the refined scales would cor- 
relate more highly. Just the reverse proved 


to be the case. The final scales correlated 
only .130 with each other, a result which 
would indicate that there is no connection, 
except perhaps in very extreme cases, be- 
tween the two attitudes.* We are therefore 
disposed to conclude that, while attitudinal 
climate may be a functional reality at the 
level of the rank and file, the relationship at 
the level of the leaders is too complex to be 
incorporated in a simple formulation of this 
kind. 


™ Compare the recent study by Form and Dan- 
sereau (2), in which it was shown that active union 
members like their jobs and their departments (“as 
a place to work”) more than inactive members. 

SA substantive analysis indicates that there may 
be a variety of reasons for management to report 
hostility to the union, and similarly on the part of 
the union. Certain combinations of these factors 
seem to give rise to “types” or patterned relation- 
ships which are repeated several times in our sample 
of 41 establishments. It should be noted that, in 
constructing these “types,” we did not assume either 
a high correlation of management and union atti- 
tudes or the legitimacy of a combined attitude score. 
Types were constructed by choosing a cutting point 
on each scale separately and treating as a single type 
those establishments which had certain combinations 
of high or low scores (see [1] for details). 


Discussion 


This report deals with a portion of the data 
from an extensive quantitative study of un- 
ion—management relations at the local level 
(1). A Guttman analysis (supported by cor- 
relational data) indicates that unidimensional 
scales have been established both for manage- 
ment attitudes to the union and for union 
attitudes toward management. 

The scales developed are strongly character- 
ized by what may be called a “halo” factor, 
or a generalized tendency to approve or dis- 
approve. The better items in the manage- 
ment scale were such as, “How do you per- 
sonally feel about the company’s relations 
with the union?” “Does the union have too 
much power?” However, this does not mean 
the respondent approved (or disapproved) of 
everything associated with the union. Highly 
specific items failed to scale; e.g., “Are the in- 
ternational union representatives skillful bar- 
gainers?’”’ The same tendency (for general- 
ized items to scale, and for specific items not 
to scale) occurs with the union attitudes. It 
would appear that a Guttman scale may be 
simply an elaborate and highly reliable way 
to ask, ‘““Do you approve of the union?” 

Such scales can, of course, be of consider- 
able value in research which involves quantita- 
tive estimates of union or management atti- 
tudes. It is to be hoped that the scales here 
reported will be seen by other students of in- 
dustrial relations as valuable devices, to be 
incorporated in their own research designs. 
This seems to be a path by which progress in 
this field can be greatly accelerated. 


Summary 


1. Fourteen multiple-choice items, relating 
to the attitude of management toward the un- 
ion, were presented to the two top labor-rela- 
tions officials of 41 Illinois companies. Cor- 
relational analysis indicated the presence of 
a single factor, or unitary attitudinal dimen- 
sion, involving ai least seven items. Guttman 
analysis confirmed this, and produced a “true 
scale” consisting of nine of the original items 
plus one contrived item combining two ques- 
tions. 

2. A parallel analysis of responses from the 
union officials in these same establishments 





300 Ross Stagner, W. E. Chalmers, and Milton Derber 


led to similar conclusions and to a Guttman 
scale of nine of the original items. 

3. Attitude of management toward union is 
uncorrelated with attitude of union toward 
management, when measured by these refined 
scales. 

4. The refined scales are valuable instru- 
ments for substantive research on union-man- 
agement relations at the plant level. 


Received January 17, 1958. 


References 


1. Derber, M., Chalmers, W. E., & Stagner, R. Uni- 
formities and differences in local union-man- 


agement relationships. Indus. Labor Rel. Rev., 
1957, 11 (Oct.), 56-71. 

. Form, W. H., & Dansereau, H. K. Union mem- 
ber orientation and patterns of social integra- 
tion. Indus. Labor Rel. Rev., 1957, 11 (Oct.), 
3-12. 


3. Guttman, L. The basis for scalogram analysis. 


In: S. A. Stouffer et al., Measurement and 
prediction. Princeton, N. J.: Princeton Uni- 
ver. Press, 1950. Pp. 60-90. 

. Institute of Labor and Industrial Relations, Uni- 
versity of Illinois. Labor-management rela- 
tions in Illini City. Vol. 1, Case studies. Vol. 
2, Exploration in comparative analysis. Cham- 
paign, Ill.: Author, 1953, 1954. 


5. Williams, L. K. Scaling of interview data on two 


union-management relationships. M.A. thesis, 
Univer. Illinois, 1954. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


The Attitudes of West Texas College Students Toward School 
Integration 


Herbert Greenberg and Dolores Hutto 
Texas Technological College 


With the recent Supreme Court decision 
making segregation in schools unconstitu- 
tional, interest is high in the question of school 
integration. Much has been said about the 
consequences of integration if effected too 
rapidly, and on the other side, the pressure 
has been great to desegregate just as quickly 
as the physical change could be effected. Few 
attempts have been made, however, to scien- 
tifically determine the attitudes of the stu- 
dents themselves concerning integration. Thus, 
the senior author and his associates conducted 
a study (2) of Negro and white high school 
students in the West Texas area. A logical 
next step it seemed was to conduct a similar 
study on the college level. This seemed espe- 
cially appropriate after the incidents in Ala- 
bama and the contrary smooth integration at 
the University of North Carolina. 


Problem 


1. To determine the attitudes of students in 
a large West Texas college toward school in- 
tegration in general and to specific areas of 
this integration 

2. To determine whether there is a rela- 
tionship between authoritarianism and _ atti- 
tudes toward school integration 

3. To determine whether classification in 
school is related to authoritarianism or to 
integration attitudes 


Procedures 


Population. A battery of tests was administered 
to 460 students of Texas Technological College of 
Lubbock, Texas. This number represents approxi- 
mately 7 per cent of the student population and was 
selected on the basis of an exact breakdown as to 
division, i.e., engineering, liberal arts, etc.; classifica- 
tion; and sex. The tests were administered under 
the supervision of the senior author by administrators 
to seven elementary psychology classes. The personal 
data were then checked to determine the divisional, 
classification, and sex breakdown cbtained from the 
sample. Where the test sample deviated from correct 


301 


representation of the universe, ie., student population 
of Texas Tech, cases were eliminated from the over- 
represented groups till the tested population did ac- 
curately represent the universe. This could success- 
fully be done for sex and divisional breakdown, but 
would have required an overly severe paring in order 
to create a proportionately representative group, frem 
the senior and graduate classes, as comparatively few 
of these students could be found in elementary psy- 
chology. To make up this deficiency, students were 
approached at random in the cafeteria and asked 
their classification and division, and those students 
who fitted the need were asked if they would co- 
operate by filling out a short questionnaire. Agree- 
ment was received in every case, so that the problem 
of selective sampling need not be considered. Through 
this method, the exact categorical breakdown dis- 
cussed above could be achieved while allowing the 
bulk of the sampling to be derived from group test- 
ing in the seven psychology classes. 

The test battery. The California F Scale, the In- 
tegration Attitude Scale and a personal data form 
were employed. The California F Scale (1) was 
designed to measure authoritarianism in personality 
through a series of 29 statements to which the re- 
spondent reacts by strongly agreeing, agreeing, mildly 
agreeing, strongly disagreeing, and mildly disagreeing. 
An example item is “Familiarity breeds contempt.” 

The Integration Attitude Scale (2), “IA,” was de- 
signed to measure attitudes toward school integration 
in general and specific areas in integration in par- 
ticular. Thus, it is the purpose of this instrument 
not merely to obtain over-all integration attitudes, 
but to determine integration attitudes as applied to 
sports, classrooms, eating facilities, etc. For the 
purposes of this present study, question Number 24, 
“I do not think I would be willing to sit next to a 
member of another race in class,” was omitted and 
was replaced by, “I would not accept a member of 
the other race as a roommate in the dormitory.” 

The personal data form was designed to secure 
personal-social data so as to permit the establishment 
of comparison groups between the classification, 
sexes, and divisions leaving integration attitudes and 
authoritarianism as variables. 

Procedures in treating data. Means, standard devia- 
tions were found for each classification, division, and 
sex for each scale. Critical ratios were then obtained 
between the means of each of these breakdowns for 
each scale. Thus, freshmen F-scale means were com- 
pared with those of the sophomores, juniors, seniors, 
and graduates while the same procedure was fol- 
lowed with the IA. Likewise, comparisons were 





Herbert Greenberg and Dolores Hutto 


Table 1 
Product-Moment Correlation, IA to F 








Product- 
Moment 
Correlation 





Freshmen 
Sophomores 
Juniors 

Seniors 
Graduates 

Total Population 


0.215 
0.167 
0.15 

0.289 
0.518 
0.213 





made between males and females for both scales and 
between the several divisions. 

Lastly, an item analysis of the IA Scale was con- 
ducted in which the number of positive and negative 
responses for each item was determined for each 
classification, division, sex, as well as for the total 
population. This was done to ascertain group at- 
titudes in specific areas of integration. 


Results 


Table 1 shows product-moment correlations 
for the classifications and for the total popula- 
tion IA to F. It can be seen that there is a 
very low correlation between the two scales, 
the only significant one being the +0.52 for 
the graduates. It should be noted further 
that the interscale correlation computed for 
the several divisions and for each sex yielded 
approximately the same results as for the total 


Table 2 


Means and Standard Deviations for the Several 
Classifications for Both Scales 








Classifi- Mean SD Mean SD 
cation F F IA IA 


— 


Freshmen 
Sophomores 
Juniors 
Seniors 
Graduates 





91.78 41.09 
94.63 11.27 
93.63 37.82 
98.13 40.58 
97.39 42.5 


108.83 
108.59 
113.26 
103.7 
105 


20.008 
21.66 
21.64 
22.3 
17.44 





population. Thus, they are not presented here 
in separate tables. 

Tables 2 and 3 show means and standard 
deviations for each classification and the criti- 
cal ratios between designated comparison clas- 
sifications. It should be noted that a high 
score designates greater authoritarianism and 
more negative attitudes toward integration. 
Thus, the freshmen mean of 108.83 on the F 
scale makes that group significantly more 
authoritarian than the seniors, having a mean 
of 103.7. Conversely, on the IA scale the 
seniors, with a mean of 98.13, have signifi- 
cantly more negative attitudes toward school 
integration than the freshmen, whose mean is 
the lower score of 91.78. It should be noted 
further that, as in the case of Table 1, there 


Table 3 
Critical Ratios Between the Means of the Several Classifications for Both Scales 








Classification 


C.R. “F” 


C.R. “IA” 





Freshmen/Sophomores .29 
Sophomores/ Juniors 1.66 


Juniors/Seniors 2.97 


Freshmen/Seniors 1.73 
Freshmen/ Juniors 1.72 


Sophomores/Seniors 1.88 


01% level 


05% level 


05% level 


1,92 
05% level 


1.87 
05% level 


O1% level 


.001% level 


05% level 


01% level 








Attitudes Toward 


was no significant difference between the 
means of each sex and between the several 
divisional means. Thus, they are not pre- 
sented separately. 

In most areas the student attitude toward 
integration is quite positive. For the most 
part, however, the more advanced classes ap- 
pear less positive than the freshmen and sopho- 
mores. It can be further noted that in the 
areas of closer and closer personal-social con- 
tact the relative positiveness of the answer 
decreases until such areas as dating (Item 20, 
“I believe that dating between races will be 
a serious problem soon after integration,” and 
Item 21, “I would not mind ‘double dating’ 
with a couple both of whom were of the other 
race”) and dancing (Item 12, “Different racial 
groups mixing at school functions |dances, 
parties, etc.] will not be wise—it will only 
result in fights and ill feeling between races’’) 
a majority of the students expressed negative 
attitudes. 

In many items, on the other hand, students 
in all classifications expressed overwhelmingly 
positive attitudes. For example, in Item 1, 
“Tf another race was integrated into my school, 
I would do my best to accept them as class- 
mates and equals,” over 90 per cent of the 
respondents expressed agreement. Likewise, 
in questions pertaining to integration in class- 
rooms, athletics, and even fraternities, a large 
percentage of all classifications expressed favor- 
able integration attitudes. In 22 out of the 
29 “IA” items, a majority of all classifications 
expressed favorable integration attitudes, while 
in most of the remaining 7 items the several 
classifications were divided in attitudes, in 
general, the higher classifications as indicated 
above having the less favorable attitudes. 

An interesting result of this study also is 
that the pattern is almost identical to that 
shown by the high school group in our earlier 
study (2). The high school group have con- 
siderably higher authoritarian attitudes than 
the college group and somewhat higher scores 
on the IA scale, but the clear pattern of posi- 
tive attitudes with the decreasing positiveness 
in the close personal-social areas is present for 
both groups. 


School Integration 


Conclusions 


1. There is apparently little direct rela- 
tionship between authoritarianism and nega- 
tive integration attitudes for this population. 
The integration attitudes in the South are ap- 
parently so much a part of the culture that 
they are separated from the more generalized 
attitudes covered by authoritarianism. This 
hypothesis is lent additional weight by the 
fact that only in the graduate group some 
correlation was found. This, of course, is the 
group that through maturity and education 
may have gained insight into the relationship 
of the attitudes. 

2. In general, it may be concluded that 
there is a generally positive attitude toward 
integration on the part of the white college 
student in West Texas, thus easing the widely 
expressed fear of serious trouble when inte- 
gration comes. This conclusion is based on 
two cts. First, that the IA Scale means for 
all .asifications lie well below a potential 
middle score of 116. This fact indicates that 
far more than 50% of the responses were 
those positive toward integration. This fact 
alone, however, would not be sufficient to 
justify this conclusion. The additional factor 
of the actual numbers of items to which an 
overwhelmingly positive response, as opposed 
to negative responses, was made adds sub- 
stantial weight to the above statement. It 
should be noted that in 22 out of the 29 IA 
items the positive integration attitudes out- 
number tie negative ones in all five classifica- 
tions, while in most of the remaining 7 the 
classes are divided in their response. Further, 
in many items there is better than a two to 
one ratio of positive to negative response. 

3. Problems may be looked for in the areas 
of close personal-social contact such as socials, 
parties, dances and the like. 

4. There is apparently a discrepancy be- 
tween student and parental attitudes. Thus, 
where about 90 per cent of the students say 
they would do their best to accept members of 
the other race as classmates and equals and 





304 Herbert Greenberg and Dolores Hutto 


the majority would accept them into clubs References 
and even as good friends (Item 15), a definite 1. Adorno, T. W., Frenkel-Brunswik, Else, Levinson, 


majority would not bring these classmates and D. J. & Sanford, R. N. The authoritarian 
personality. New York: Harper, 1950. 


even gocd friends home with them due to fear 2 Geman. cite bl. & Comen, tM, I. 
of parental disapproval (Item 8). Attitudes of white and Negro high school stu- 

dents in a west Texas town toward school in- 
Received April 8, 1957. tegration. J. appl. Psychol., 1957, 41, 27-31. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Anxiety Level and Score on a Biographical Inventory ' 


Donald H. Kausler and E. Philip Trapp 


University of Arkansas 


Siegel (5, 6) has recently developed a bio- 
graphical inventory, The Biographical Inven- 
tory for Students (BIS), which shows promise 
as a predictor test, particularly in the area of 
academic achievement. He found, for exam- 
ple, that four of the 10 subscales, Action, 
Heterosexual Activities, Political Activities, 
and Dependence upon the Home, correlated 
significantly with college grade point average. 

The relationship between scores on the De- 
pendence upon the Home subscale (Dep) and 
grade point was curvilinear, with the higher 
grades corresponding to the extreme positions 
on the Dep continuum. In a follow-up study, 
Kausler and Little (4), relating Dep scores 
with grades in psychology courses, obtained 
a similar curve. 

Siegel, in discussing the meaning of this 
curvilinear relationship, suggests that the Ss 
in the middle of the Dep range are more in- 
secure than the Ss located at the two ex- 
tremes. This difference in security accounts 
primarily for the difference reflected in aca- 
demic performance. 

The present authors prefer to interpret 
Siegel’s findings in terms of a more clearly 
defined motivational variable. The curvilinear 
relationship suggests the operation of varying 
levels of anxiety drive at varying levels of 
dependency. In other words, the Dep sub- 
scale may, in effect, be measuring a motiva- 
tional source of individual differences. 

This study is directed at investigating the 
suggested relationship between the Dep sub- 
scale and anxiety level. It is hypothesized 
that: (a) the Ss scoring at the two extremes 
on the Dep subscale of the BIS will differ 
significantly in anxiety level from Ss scoring 
in the intermediate range; and (5) the Ss 
scoring at the two extremes on the Dep sub- 
scale will reflect lower anxiety level than the 
Ss scoring in the intermediate range. The 


1 The authors wish to thank Laurence Siegel and 
Educational Testing Service for their permission to 
use the BIS in this study. 


hypothesized direction of the differences be- 
tween the two groups follows Farber’s sug- 
gestion (1) that performance level is appar- 
ently inversely related to anxiety drive level 
on complex learning problems. 


Method 


Subjects. Since only the BIS form for men is 
currently available, the Ss were 64 male students, 
predominantly freshmen and sophomores, enrolled in 
general psychology at the University of Arkansas. 

Procedure. The entire BIS was administered with 
standard instructions to the Ss during a regular class 
period, though only the Dep subscale was used in 
this study. The Ss were informed that the objective 
of the inventory was to compare biographical infor- 
mation of students at the University of Arkansas 
with students in other universities. 

Approximately one month later, the Taylor Mani- 
fest Anxiety Scale (7) was administered to the Ss, 
again during a regular class period. The scale, titled 
Biographical Inventory, was described to the Ss as 
an additional source of biographical information for 
comparing students at Arkansas with students else- 
where. . The standard scoring procedures were em- 
ployed. 


Results 


The Ss were divided into two groups on the 
basis of their scores on the Dep subscale. The 
scores ranged from — 2 to + 20, with a me- 
dian of 9. The Ss in the first and fourth 
quartiles formed the extreme group (V = 32), 
and the Ss in the second and third quartiles 
formed the intermediate group (N = 32). 
The Ss were further classified according to 
their level of “manifest anxiety,” applying 
the method usually followed in studies with 
the Taylor scale. The upper 25%, based on 
the distribution of scores for all 64 Ss, con- 
stituted a high “manifest anxiety” group (HA, 
N = 16), the lower 25% a low “manifest anx- 
iety” group (LA, N = 16), and the middle 
50% a middle “manifest anxiety” group (MA, 


Cc 


N = 32). 


305 


Since both the Taylor scale and the BIS 
are ordinal scales, the hypotheses investigated 
were tested by a nonparametric test, y*. A 





Donald H. Kausler and E. Philip Trapp 


Table 1 


Dep Score and Anxiety Level: Observed and 
Expected Frequencies 








Extreme Dep Intermediate Dep 





8 8 
ig 2 14 


MA 16 16 


LA ‘ 





Note.—x? = 12.36 (p < .01). 


3 X 2 contingency table was established com- 
prising two groups, Extreme Dep (high and 
low scores on Dep) and Intermediate Dep 
(intermediate scores on Dep), and three clas- 
sifications, High Anxiety (HA), Middle Anx- 
iety (MA), and Low Anxiety (LA). The re- 
sults of this analysis are presented in Table 1 
below. 

An examination of Table 1 reveals a large 
difference in “manifest anxiety” between the 
Extreme Dep and Intermediate Dep Groups 
and in the hypothesized direction. The Inter- 
mediate Group contains a preponderance of 
HA Ss, while the Extreme Group contains a 
high proportion of MA Ss. The overall y’ is 
12.36, which is significant at the .01 level of 
confidence. 


Discussion 


Successful performance on complex tasks, 
such as college courses, is dependent upon 
interactions among a multiplicity of aptitude 
and motivational factors. The efficacy of bio- 
graphical information in predicting perform- 
ance on such tasks, therefore, depends upon 
the extent to which this information measures 
one or more of these factors. The biographi- 
cal information derived from such scales as 
the BIS is interpreted by the authors as tap- 
ping primarily the motivational area. That 
is, certain facets of the individual’s motiva- 
tional system, depending, of course, on the 
content of the biographical data, express 
themselves through his personal history. The 


present study substantiates this interpretation 
in demonstrating a relationship between scores 
on the Dep subscale of the BIS and the mo- 
tivational variable of anxiety as measured by 
the Taylor scale. However, a more rigorous 
interpretation of this relationship in terms of 
drive level is restricted by recent criticisms 
(2, 3) of the Taylor scale as a measure of 
drive. ; 

The results of the present study also ap- 
pear consistent with common clinical evidence 
on manifest anxiety. An individual's anxiety, 
ordinarily, should be more controlled or re- 
duced when competition between his depend- 
ency and independency needs is low rather 
than high. Individuals scoring at the extreme 
positions on the Dep subscale are likely to 
experience relatively low competition between 
their dependency and independency needs be- 
cause of marked subjugation or suppression of 
one need by the other. On the other hand, 
individuals scoring in the middle range, re- 
flecting a closer balance of strength between 
their dependency and independency needs, 
are more likely to experience greater inter- 
need conflict. The outcome would be evident 
in higher manifest anxiety. 


Summary 


The following two hypotheses were tested 
in the present study: (a) the Ss scoring at 
the two extremes on the Dep subscale of the 
BIS will differ significantly in anxiety level 
from Ss scoring in the intermediate range; 
and (4) Ss scoring in the intermediate range 
will reflect higher anxiety level. Anxiety level 
was measured by the Taylor Manifest Anx- 
iety Scale. Sixty-four male students at the 
University of Arkansas served as Ss. In the 
analysis of results, a 3 X 2 contingency table 
was constructed comprising two groups, Ex- 
treme Dep and Intermediate Dep, and three 
classifications, High Anxiety, Middle Anxiety, 
and Low Anxiety. The over-all y* was 12.36, 
significant at the .01 level of confidence, and 
in the hypothesized direction. The findings 
support the interpretation that the Dep sub- 
scale of the BIS is measuring the motivational 
variable of anxiety. 


Received November 4, 1957. 





Anxiety Level and Score on a Biographical Inventory 307 


References 


. Farber, I. E. The role of motivation in verbal 
learning and performance. Psychol. Bull, 
1955, 52, 311-327. 

. Hill, Winfred F. Comments on Taylor’s “Drive 
theory and manifest anxiety.” Psychol. Bull, 
1957, 54, 490-493. 

. Jessor, R., & Hammond, K. R. Construct Va- 
lidity and the Taylor Anxiety Scale. Psychol. 
Bull., 1957, 54, 161-170. 

. Kausler, D. H., & Little, N. D. The BIS depend- 


ency scale and grades in psychology courses 
J. counsel. Psychol., 1957, 4, 223-224. 


5. Siegel, L. A biographical inventory for students 


I. Construction and standardization of the in- 
strument. J. appl. Psychol., 1956, 40, 5-10 

. Siegel, L. A biographical inventory for students 
II. Validation of the instrument. J. appl. 
Psychol., 1956, 40, 122-126. 

. Taylor, Janet A. A personality scale of manifest 
anxiety. J. abnorm. soc. Psychol. 1953, 48, 
285-290. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Development of a Criterion of Research Productivity ' 


Robert E. Stoltz 


Southern Methodist University 


The purpose of this study was to develop a 
criterion of research productivity for a physi- 
cal science research organization where the 
usual, seemingly more objective criteria were 
not present or not applicable (i.e., number of 
publications) . 


Method 


Subjects. The subjects in this study were all tech- 
nically trained physical science research workers, 
holding at least a bachelor’s degree in one of the 
physical science or engineering fields. The Ss were 
employed in research work covering a wide variety 
of science and engineering fields such as Mechanical 
Engineering, Reactor Metallurgy, Petroleum Chem- 
istry, Ceramic Engineering, Nonferrous and Ferrous 
metallurgy. The raters were all research supervisors 
and technically trained persons. The raters’ position 
in the company was analogous to the highest level of 
line supervision in the typical industrial organization. 
The Ss were all employees of a large midwestern re- 
search institute employing approximately 2,500 per- 
sons, of whom over one third might be considered to 
be technical research persons. The work of this in- 
stitute is devoted largely to contract research; how- 
ever, considerable time is given to pure, theoretical 
projects. 

Development of the scale. Each of 27 research 
supervisors participated in a one and one-half hour 
interview with the investigator. This group of super- 
visors represented 57% of the supervisors at this 
level and constituted a cross section of all divisions 
of the institute. The interviews were conducted along 
a modification of Flanagan’s Critical Incident tech- 
nique (1). Informants were to select and describe 
three persons whom they considered to match the 
following descriptions: Most Productive, Most Non- 
productive and Most Creative. The informants were 
then instructed to compare and contrast these per- 
sons, indicating to the investigator in what ways the 
persons they were describing behaved alike and in 
what ways they behaved differently in their research 
activities. The informants were instructed to use 
their own words, and to describe specific character- 
istics or events and to use whatever dimension for 
comparison or contrast they might wish. 

At the conclusion of the series of interviews, the 
comments of the informants were examined and over 
500 behavior statements were extracted. Eight psy- 


1The investigator is indebted to the staff of the 
Battelle Memorial Institute for their cooperation and 
assistance in conducting this study. 


chologists were each asked to examine the items and 
to group them into whatever categories they felt the 
items represented. The investigator and two of the 
eight psychologists then reviewed the groupings that 
had been made, and finally decided on 15 categories 
into which the majority of the statements could be 
classed. Fifteen statements, or items, were selected 
from each category as most representative of the 
category by the three judges. Twenty-five other 
items, which could not be categorized with any agree- 
ment by the judges, but which were felt to be of 
interest and possible predictive value, were also 
chosen. While a discussion of the categories is neces- 
sary in order to establish the way in which the 
checklist was constructed, the categories themselves 
were not used in this inquiry and will not be re- 
ferred to again. Additional information regarding 
the checklists may be found in an unpublished report 
by Stoltz (2). The items were reviewed again to de- 
termine whether or not they conformed to three re- 
quirements: (a) They must be observable by the 
persons who were to use the checklists; that is, ad- 
ministrative supervisors; (6) The behaviors men- 
tioned in the items must be general enough to apply 
to persons in diverse fields of research inquiry and 
also general enough to apply to settings other than 
the present one. This was done so that in future 
research the same items might be used on new popu- 
lations and make possible meaningful comparisons; 
(c) The general behavior mentioned in the item must 
have been suggested by a majority of the informants. 
All of the selected items met the above requirements 
or were reworded to conform to them. 

Forty research supervisors and assistant research 
supervisors described 80 research personnel using this 
checklist (entitled the Productive Behavior Check- 
list). It should be pointed out that there is some 
possible contamination of the results as it was found 
necessary to use 15 of the original informants as 
raters in this phase of the study. Each rater was 
asked to describe two specific individuals: the most 
productive and the least productive man they knew. 
Twenty of the raters were instructed to rate the pro- 
ductive man first, the remaining 20 raters were asked 
to rate the least productive man first. Each item was 
to be rated on a five-point scale, with five indicating 
the highest possible rating, according to how well the 
item described the person being rated. The pages in 
the checklists were assembled in random order so 
that any tendency to rate items differentially due to 
serial position in the checklist would tend to be 
randomly distributed. 

The checklists were then collected and the ratings 
for each item were converted into two indices. A 
discrimination index (DI) was computed for each 


308 





A Criterion of Research Productivity 


item by determining the amount of overlap between 
ratings for the productive and the nonproductive per- 
sons. On a sample of 50 randomly selected items, 
these DI’s correlated .91 with ¢ tests of the signifi- 
cance of the difference between the means of the two 
groups. A preference index (PI) was determined for 
each item by computing the mean rating given the 
item for both the productive and the nonproductive 
group. Two forced-choice type rating scales were 
constructed to use these items and their preference 
and discrimination indices. These scales were titled 
Research Behavior Description I and II (RBD I and 
RBD II). The items within each scale were com- 
posed of tetrads of items from the checklist, with 
each tetrad having two items with discrimination in- 
dices approximately 20 points greater than the dis- 
crimination indices of the remaining two, and with all 
items in the tetrad having nearly identical preference 
indices. RBD I contained 25 tetrads and RBD II 
contained 26 tetrads.? 

Administration of the scale. Twenty-six divisions 
of the organization were selected to furnish raters for 
the study. These divisions constitute the principal 
units of organization of the institute. A division is 
to a great extent a functionally independent unit. A 
division comes into existence to handle a particular 
type of research problem and remains a division so 
long as that area of research remains important. If 
research activities in the area become increased, or if 
a particular aspect cf the original problem becomes 
important enough to demand its own organization, 
the division enlarges or subdivides. Divisions range 
in size from 12 to 15 technical persons to over 100 
technical persons. The division titles reflect the ac- 
tivity of the division. For example, some of the 
divisions covered in this study were Industrial Eco- 
nomics, Physical Chemistry, Ceramics, Nonferrous 
Physical Metallurgy, Light Metals, and Physical 
Metaliurgy. Each division was sent six copies of 
RBD I and six copies of RBD Il. Each division was 
also given a list of three men who were to be rated. 
These men were selected by the members of the 
personnel department on a random basis. The divi- 
sion supervisor was asked to select two supervisors 
to rate each man using both RBD I and RBD II. 
Following each rating the supervisor doing the rating 
was to indicate on each scale his over-all evaluation 
of the man being rated on a five-point scale, where 
five was the highest rating possible. 

At the end of a three-week period, 139 RBD I and 
136 RBD II forms had been returned out of the 156 
copies of each form that had been distributed. It 
had previously been decided to use for analysis only 
those forms returned by this deadline. A score for 
each person rated was determined for each form by 
counting the number of times a rater had selected a 
higher discriminating item from each tetrad. An 
average RBD I rating was determined for each per- 
son rated by averaging the RBD I scores from his 
two raters. An average RBD !! rating was deter- 


2 Copies of the checklists or RBD I and IT may be 
obtained from the author. 


Table 1 


Research Behavior Description Reliabilities 
and Validities 


Scale 


Coefficient 


RBD I 

RBD II 

RBD I avg. vs. 
RBD I avg. 

Five-Point Rating 
Criterion 


Interrater 
Reliability 


RBDI 

RBD II 

RBD I Average @ s 
RBD IT Average 59 .74 


Validity >.01 
>.01 
>.01 


>.01 


Note.—RBD averages were correlated with person's average 
rating on five-point criterion scale to determine validity. 


mined for each person in the same manner. An over- 
all evaluation was determined for each person rated 
by averaging the rating given him on the five-point 
scale by each of the two raters. Since in some cases 
there was only data from one rater the Ns in the 
accompanying table will not be identical. In addi- 
tion, the over-all ratings were quite negatively skewed 
and were consequently converted to normalized T 
scores. Intercorrelations between the possible com- 
binations were computed and are shown in Table 1. 


Results 


Interrater reliability for RBD I was found 
to be .31. Interrater reliability for RBD II 
was found to be .62 and the reliability of the 
over-all rating on the five-point scale, after nor- 
malizing, was found to be .70. In view of the 
widely different reliability coefficient for the 
two forms an internal consistency item analy- 
sis was conducted to determine to what extent 
tetrad content influenced the coefficients. Al- 
though the tetrads had been randomly as- 
signed to the two forms it was found that 
when biserial correlations were computed be- 
tween tetrad scores and total scale scores, nine 
tetrads on RBD I and six tetrads on RBD II 
were found to be insignificantly related to 
total scale score. Moreover, four of the nine 
insignificant tetrads on RBD I had negative 
signs, while only one of the six insignificant 
tetrads on RBD II had negative signs. It 
would appear that the low reliability of RBD 
I is partially accounted for by its containing 
more poorly constructed tetrads. This view 





310 


is supported by the fact that when the biserial 
correlations were computed between the tet- 
rads on RBD I and the over-all five-point 
rating, essentially the same nine items proved 
to be insignificantly related to the over-all 
rating. 

Further evidence of the reliability of the 
scale was gained by determining the product 
moment correlation between the average RBD 
I and average RBD II ratings on each person 
for whom there are four ratings. This corre- 
lation was found to be .74. 

The validity of the scales was assessed in 
several ways. First, each person’s rating was 
correlated with his over-all rating on each 
form. RBD I was found to have a validity 
coefficient of .48 and RBD II to have a coef- 
ficient of .60. When the person’s average rat- 
ing on each form was correlated with his aver- 
age over-all rating on the five-point scale, 
RBD I showed a coefficient of .55 while RBD 
II showed a correlation of .74. 


Discussion 


RBD II showed quite satisfactory inter- 
rater reliability and validity. RBD I did not 
show adequate interrater reliability, but this 


was no doubt due to the fact that this form 
contained a much greater number of insig- 
nificant, no doubt poorly constructed, tetrads 
than did RBD II. 

Validity coefficient based on the average 
ratings of two raters showed increased validity 
over either scale alone as one might expect. 
The correlation between the average ratings 
on the two scales indicates that even with the 
unreliability of RBD I, the two scales are 
measuring much the same thing. 

It appears that it is possible to develop a 
forced-choice rating scale to determine the 
extent of a person’s productive research be- 
havior, at least in so far as it is evaluated by 
his superior. The extent to which ratings on 
the RBD scales correlate with other measures 
of research productivity is not yet known. 
Neither is it known to what extent subordi- 
nates’ evaluation of productivity might agree 
with superiors’ evaluations. All of these prob- 


Robert E. Stoltz 


lems are yet to be answered by future ap- 
plications of the scales and their derivatives. 

Perhaps the most interesting fact of the 
entire study is that the rating scales predicted 
as well as they did when one considers the 
heterogeneous character of the divisions used 
in the study. It would appear to indicate that 
if the rating forms are evaluating productive 
behavior, the same type of behavior is given 
merit in one division as in another even 
though the activities of the divisions may be 
quiet different. There appear to be two pos- 
sible, perhaps related, explanations. One, 
that a person’s productivity is influenced most 
by characteristic work habits or attitudes that 
the person shows in all challenging activities 
and is somewhat independent of job knowl- 
edge. Or two, that supervisors are all re- 
sponding to a stereotype of the productive per- 
son that they reward in each division and 
which may have nothing to do with over-all 
productivity in a more objective sense. The 
present study does not indicate which of these 
two interpretations is correct. One would 
prefer the former explanation if less concern 
is in developing and understanding the pro- 
ductive process; but if our concern is predict- 
ing the ability of a person to rise in company 
esteem and status, the latter is of just as much 
benefit. 


Summary 


A forced-choice rating scale designed to de- 
termine the extent of a person’s productive 
research behavior was developed at a Mid- 
western physical science foundation. Of the 
two experimental scales developed the better 
form showed an interrater reliability coef- 
ficient of .62 and a validity of .60. When the 
ratings of two raters were averaged the validity 
of the scale increased to .74. 


Received December 16, 1957. 


References 


1. Flanagan, J. C. The critical incident technique. 
Psych. Bull., 1954, $1, 327-358. 

2. Stoltz, R. E. A study of productivity in a re- 
search setting. Unpublished doctoral disserta- 
tion, Ohio State Univer., 1956. 





Journal of Applied Psychology 
Vol, 42, No. 5, 1958 


Performance as a Function of Control-Display Relations, Positions 
of the Operator, and Locations of the Control ' 


Michael Humphries * 


University of Toronto, Canada 


The ease with which a particular combina- 
tion of a control and a display can be learned 
and operated is frequently determined by the 
nature of the control-display movement rela- 
tion employed. Operators appear to expect, 
on the basis of prior experience, that certain 
control movements should produce certain 
specific display changes (15). If these move- 
ment relationships are not present, perform- 
ance is frequently impaired. These effects 
have been demonstrated on the Two-Hand 
coordinator (4, 17, 23, 25, 29, 32), and Turret 
Pursuit Apparatus (17, 21, 22, 33), the Mash- 
burn Apparatus (1, 2, 10, 17, 18, 19, 20, 26), 
the Toronto Complex Coordinator (15, 31), 
and for a variety of levers (6, 11, 13, 19, 24), 
rudder pedals (13, 14, 24) and knobs (7, 8, 
12). The basic conclusion from these experi- 
ments is that the movement of the control 
should be in the same plane and in the same 
direction as the resulting display movement 
in order to obtain the highest performance 
level and the fastest learning (5, 16). 

Although the directions and planes of move- 
ment of the two elements should be consistent 
in most cases, there are a few situations in 
which this generalization does not apply. In 
systems employing cranks or knobs it is dif- 
ficult to define consistency since rotary move- 
ment of a crank or knob is converted into 
linear movement on the display. This is the 
case on most two-hand coordinators. Also, 
in some simple tracking tasks using levers 
and where the operator does not work under 
stress conditions, the principle of consistency 
does not apply; controls can be connected to 


1 This research was conducted at the University of 
Toronto under Research Grant No. 9401-02 from the 
Defence Research Board of Canada. 

This paper represents part of a dissertation sub- 
mitted to the University of Toronto in 1957 in partial 
fulfillment of the requirements for the Ph.D. degree. 
The writer wishes to acknowledge the assistance of 
A. H. Shephard of the University of Toronto. 

2Now with the Defence Research Medical Labora- 
tories, Toronto, Canada. 


311 


the display so that the movement of the con- 
trol is inconsistent with display movement 
without impairing performance level (9, 27, 
28, 34). In view of these limitations the re- 
mainder of the present work should be con- 
sidered in terms of the situations where only 
relatively complex tasks are used and where 
linear control movement results in linear dis- 
play change. 

It has also been demonstrated that the posi- 
tion of the operator relative to the display is 
an important factor in determining the level 
of performance. In an experiment soon to be 
reported, the operator was seated to one side 
of the display so that his frontal plane was 
at right angles to the frontal plane of the 
display. Under these conditions, with the 
control in front of him the operator had to 
turn his head slightly to the left in order to 
see the display. It was found that this body 
position greatly improved the level of per- 
formance obtained from a control-display rela- 
tionship which normally leads to an extremely 
low level of performance when the S is seated 
directly in front of the display (15). This 
suggested that there might be an interaction 
between the effects of body orientation and 
control-display relations. Since there was 
neither a theory nor sufficient empirical evi- 
dence to suggest what the hypothesized inter- 
action might be, a model was constructed for 
this purpose. 

The model was designed to predict the ef- 
fect on performance of variations in control 
position, body orientation and control-display 
relations for the Toronto Complex Coordi- 
nator (31). Since the model was not com- 
pletely successful and as it is not possible to 
describe it in a brief paper, only the empirical 
findings will be reported at this time. 


Method 
Apparatus 


The apparatus used was the Toronto Complex 
Coordinator (TCC). This unit has been described 





312 


in the literature (15, 31); consequently only a brief 
description will be given in this paper. 

The S is seated before a vertical panel of 81 light 
assemblies, each consisting of an inner green disc and 
an outer red ring. These lights are covered by an 
opaque glass screen. Only one green disc and one 
red ring can be seen at a time. Position of the ring 
is determined by a stepping switch; that of the green 
disc by the position of the S’s control. The S is re- 
quired to move an airplane-type control stick so 
that the green disc within the ring is illuminated. 
When this has been accomplished, a new ring, or 
target, appears elsewhere on the screen. The task 
in this case is discrete, self-paced tracking. 

The measure of performance used was the number 
of matches made in a five-minute practice period. 
A match is recorded each time the S places the green 
disc within the illuminated target. 


Experimental Variables 


Three classes of independent variables were treated 
in the present work: control position, body orienta- 
tion, and control-display relationships. These three 
factors are discussed below. 

1. Control Positions: The majority of research on 
the TCC has been done under conditions where the 
control stick is perpendicular to the floor. This 
means that there can be no consistency between con- 
trol movement and vertical changes on the display; 
the operator can move the control in the horizontal 
and normal directions, but not in the vertical. For 
example, the “expected” or “standard” control-display 
relationship has required the operator to push the 
stick away from himself to make the green light 
move up the display (15, 31). 

In a number of experiments (5) it has been demon- 
strated that the planes of movement of the control 
and the display should be the same to produce the 
highest level of performance. Consequently it was 
thought that the best level of performance would be 
obtained on the TCC if the control-unit were ar- 
ranged so that both the vertical and the horizontal 
movement relations of the control and the display 
were the same. For this reason the control unit was 
mounted on a stand so that the control stick pro- 
jected towards the S, parallel to the floor and per- 
pendicular to the frontal plane of the display. In 
this way the other variables to be discussed can be 
compared under two positions of the control, one of 
which permits partial consistency in the planes of 
movement, and the other which permits complete 
consistency. 

2. Body Orientation: As mentioned previously, 
body reorientation made a usually difficult task very 
much easier than it was when the operator sat di- 
rectly in front of the display. Since the orienta- 
tion of the S could differentially affect the various 
control-display relationships to be discussed later, it 
was decided that this factor should be systematically 
varied. The three body positions tested are il- 
lustrated in Fig. 1. These positions have been desig- 


Michael Humphries 


nated according to the clock-face numerals, 3, 6, 
and 9, 

3. Control-Display Relationships: The nature of 
the electrical connection between the control and the 
display of the TCC makes it possible to arrange a 
large number of control-display relationships. Any 
given arrangement, or task, can be defined by specify- 
ing the relationship between control movement and 
display change, where these directions are defined in 
terms of the display itself, regardless of the position 
of the control unit or of the operator. Although it 








DISPLAY 








ees 


CONTROL 


°O [] oO 
° 











Fic. 1. Plan view showing the position of the con- 
trol, display, and three positions of the operator. 


is possible to arrange a number of tasks, only four 
are to be used in the present work. To be consistent 
with previously employed terminology, these four 
tasks have been defined in Table 1. In the column 
of spati«l relationships, the first letter of each pair 
refers to the direction of control movement relative 
to the display, while the second in each pair refers 
to the direction of movement of the green light on 
the display. The letters A, T, R, L, U, and D refer 
to the directions, away, towards, right, left, up, and 
down, respectively. For example, in Task A, when 
the stick is moved away from the display (A) the 
green light moves down the panel (D). 


Table 1 
Control-Display Relationships for Four Tasks on the 
TCC with the Control-Stick Mounted 
in the Vertical Position 








Spatial Relations 





T-U R- 
T-D R-L 
T-L R-U 
T-R R-D 








Control-Display Relations 


Table 2 


Control-Display Relationships for Four Tasks on the 
TCC with the Control-Stick Mounted in 
the Horizontal Position 


Spatial Relations 


R-R 


4 


l 
U 
{ 
l 


R-I 
R-L 
R-D 


The above definitions of the four tasks do not 
apply when the control-unit is mounted on a stand 
due to the difference in vertical movement relations. 
In recognition of this fact, the four tasks with the 
control-unit on the stand are defined in Table 2. 
The meaning of the letters is the same as specified 
above. 


Experimental Design 


The experimental design was a 4 X 3 X 2 factorial 
(four tasks, three body positions, and two control 
positions) with 24 independent cells, each cell con- 
sisting of 10 independent observations. This design 
permitted 23 orthogonal comparisons to be made. 


Procedure 


Each S was given a standard set of verbal in- 
structions. The control-display relation was demon- 
strated and the S was shown how to make a match. 
All Ss were instructed to make as many matches as 
possible. Each S worked continuously for five min- 
utes, on only one of the experimental conditions. 


Results 


The mean number of matches made by each 
group is shown in Fig. 2. This figure shows 
how performance on each task varies with the 
position of the operator. 

From these curves a number of trends are 
apparent: 

1. In Position 6: the levels of performance 
on Tasks C, D, C’, and D’ are similar. The 
levels of performance resulting from practice 
on Tasks B and B’ are slightly two-thirds the 
distance between the levels found for Tasks 
A and A’ on the one hand, and the levels for 
C, D, C’, and D’ on the other. Tasks B and 
B’ produce the same levels of performance; so 
do Tasks A and A’. It is concluded, therefore, 
that the positions of the control used in this 
study do not influence performance on these 
tasks when the operator is seated in Position 6. 


313 


2. In Positions 3 and 9: the levels of per- 
formance on Tasks A and B are lower in Posi- 
tions 3 and 9 than they are in Position 6, 
although the differences between the levels 
remain the same. It is concluded that the 
best performance on these tasks occurs when 
the operator sits facing the display. This is 
not the case for Tasks C and D, however. 
The performance levels for these tasks are in- 
creased by rotation of the operator, but the 
amount of improvement in each case depends 
on the direction of rotation. In Position 9, 
Task C appears to be easier than D, while in 
Position 3 the opposite is true. 

3. There is no apparent effect on perform- 
ance produced by rotation of the operator 
when he works on Tasks A’, B’, C’, or D’, 
since these tasks have essentially the same 
level of performance in Positions 3 and 9 that 
they have in Position 6. 

Statistical analysis of the data illustrated in 
Fig. 2 supports the above generalizations. A 
summary of the analysis of variance is pre- 
sented in Table 3. 








_. 
--8 
-~« 








--o 














BODY POSITIONS 


Fic. 2. Performance as a function of control- 
display relations, positions of the operator and loca- 
tions of the control. 





Michael Humphries 


Table 3 


Summary Analysis of Variance Table of the Effects of Four Tasks, Three Positions of the 
Operator and Two Positions of the Control 








Source 


df 


Mean Square PF 





Body Position (D) 
DL (Linear) 
DQ (Quadratic) 


Stand vs. Floor (P) 


Tasks (T) 
T’ ((A+A’)-(B+B’)) 
T ((C+C’)-(D+D’)) 
T* ((A+A’)-(B+ B’))- 
((C+C’)-(D+D’)) 


DL P 
DL T’ 
DL T? 
DL T* 
DQ P 
Dq T’ 
Dq T? 
Dq T* 
PT’ 

PT? 

PT* 


Di PT’ 
D, PT? 
Di PT? 
Da PT’ 
D, PT? 
D, PT? 


Error 


Total 





* Significant at the 5% level. 
** Significant at the 1% level. 


Discussion 


The results of the present experiment in- 
dicate that the difficulty of a perceptual-motor 
task on a given unit of apparatus is, in part, a 
function of the control-display relationship, 
the position of the control, and the position of 
the S with respect to the display. The effects 
on performance of various combinations of 
some values of these three classes of variables 
indicate that these factors interact with one 
another. It was observed that some tasks are 
made more difficult while others are made 
easier by rotation of the S away from a posi- 


nu Nw 
oo — 
Co Df 


4.2250 
39.6750 


47.7042 


13,996.8000 
91.8750 


34.53** 


148,553.5042 


354.0251 
12.0125 
2,387.1125 
518.4000 
69.0083 
397.8375 
57,0375 
73,112.0333 
22.5333 
185.0083 
143,521.5042 


255.6125 
1,320.3125 
3,240.0000 

87.6042 

250.1042 

66,646.5333 


405.3458 


366.49°* 


5.89* 


180.37** 


354.07** 


3.26 
7.90** 


164.42** 


tion directly in front of the display. This in- 
dicates that knowledge of the control-display 
relationship alone is not sufficient to predict 
the relative levels of difficulty. 

Also, it is not possible to predict the effects 
of body rotation on the various tasks without 
considering the position of the control. 

These observations have serious implica- 
tions for equipment design. Under conditions 
where the S regards the display over his 
shoulder, and perhaps also at other angles of 
regard, the best level of performance and the 
greatest facility in learning can be achieved 





Control-Display Relations 315 


by using somewhat unconventional control- 
display relationships. This conclusion prob- 


ably holds only for certain types and positions 
of controls and displays. Consequently, until 
others have been tested the present results 
should not be used as the basis for general 
design principles. 


Summary 


Previous research has shown that control- 
display relations, position of the control and 
position of the operator are important factors 
determining the level of performance. Since 
these variables have usually been studied in 
isolation, an experiment was designed to in- 
vestigate the interaction between them. 

Twenty-four groups of male Ss practiced 
for five minutes on the Toronto Complex Co- 
ordinator. Each group worked on only one 
of the 24 combinations of experimental condi- 
tions. The results indicate that for the same 
apparatus, knowledge of the control-display 
relations alone is not sufficient to predict the 
relative levels of performance. 


Received December 30, 1957. 


References 


. Adams, J. A. The evaluation of “difficulty of 
task” under different conditions of performance 
on the modified Mashburn Apparatus. Office 
of Naval Research, Special Devices Center, Port 
Washington, L. I., N. ¥., 1949. (Tech. Rep. 
SDC 57-2-8.) 

. Adams, J. A. The influence of the time interval 
after interpolated activity on psychomotor 
performance. Air Training Command, Hum. 
Resour. Res. Cent., Res. Bull. 55-11. San An- 
tonio, Texas: Lackland AFB, 1952. 

3. Andreas, B. G. Bibliography of perceptual-motor 
performance under varied display-control rela- 
tionships. Rochester, N. Y.: Univer. Rochester, 
1953. (Contract AF 30(602)-200 Sci. Rep. 
No. 1.) 

. Andreas, B. G., Green, R. F., & Spragg, S. D. S. 
Transfer effects between performance on a 
following tracking task (modified SAM Two- 
Hand Coordinator Test) and a compensatory 
task (modified SAM Two-Hand Pursuit Test). 
J. Psychol., 1954, 37, 173-183. 

. Andreas, B. G., & Weiss, Bernard. Review of re- 
search on perceptual-motor performance under 
varied display-control relationships. Rochester, 
N. Y.: Univer. Rochester 1954. (Contract 
AF. 30(602)-200, Sci. Rep. No. 2.) 


6. Bilodeau, E. A. Modifications of directions of 
movement preference with independent varia- 
tion of two stimulus dimensions. Air Training 
Command, Hum. Resour. Res. Cent., Res 
Bull., 51-12. San Antonio, Texas: Lackland 
AFB, 1951. 

. Carter, L. F., & Murray, N. L. A study of the 
most effective relationships between selected 
control and indicator movements. In P. M. 
Fitts (Ed.), Psychological research on equip- 
ment design, Chap. 10. Washington, D. C.: 
U. S. Government Printing Office, 1947. (AAF 
Aviat. Psychol. Prog. Res. Rep. No. 19.) 

. Fitts, P. M., & Simon, C. W. Some relations be- 
tween stimulus patterns and performance in a 
continuous dual-pursuit task. J. exp. Psychol, 
1952, 43, 428-436. 

. Foxboro Company. A study of factors determin- 
ing accuracy of tracking by means of hand- 
wheel control. Report to the Services No. 71. 
Washington, D. C.: Division 7, National De- 
fence Research Committee, 1942. (Office of 
Scientific Res. and Develpm. Rep. No. 3451.) 

. Gagne, R. M., Baker, Katherine E., & Wylie, 
Ruth C. The effects of an interfering task on 
the learning of a complex motor skill. Port 
Washington, L. I., N. Y.: Office of Naval Re- 
search, Special Devices Cent., 1949. (Tech. 
Rep. SDC 316-1-9.) 

. Gardener, J. F. Direction of pointer motion in 
relation to movement of flight-controls—cross- 
pointer type instrument. USAF, Air Materiel 
Command, Dayton, Ohio, 1950. (AF Tech. 
Rep. No. 6016.) 

. Gibbs, C. B. Transfer of training and skill as- 
sumptions in tracking tasks. Quart. J. exp. 
Psychol., 1951, 3, 99-110. 

. Grether, W. F. Efficiency of several types of 
control movements in the performance of a 
simple compensatory pursuit task. In P. M. 
Fitts (Ed.), Psychological research on equip- 
ment design, Chap. 17. Washington, D. C.: 
U. S. Government Printing Office, 1947. (AAF 
Aviat. Psychol. Prog. Res. Rep. No. 19.) 

. Grether, W. F. Direction of control in relation 
to indicator movement in one-dimensional 
tracking. AAF Hdars, Air Materiel Com- 
mand, Engin. Div., 1947. (Memorandum Rep. 
TSEAA—694-4G.) 

. Humphries, M., & Shephard, A. H. Performance 
on several control-display arrangements as a 
function of age. Can. J. Psychol., 1955, 9, 
231-238. 

. Humphries, M., & Fletcher, N. C. A classifica- 
tion of tasks in motor learning. Can. J. Psy- 
chol.,, in press. 

. Lewis, D., & McAllister, Dorothy, E. An in- 
vestigation of individual susceptibility to inter- 
ference. Office of Naval Res., Special Devices 
Cent., Port Washington, L. I., N. Y., 1950. 
(Tech. Rep. SDC 938-1-10.) 





316 


18. Lewis, D., McAllister, Dorothy E., & Adams, J. 


A. Facilitation and interference in perform- 
ance on the modified Mashburn Apparatus: I. 
The effects of varying the amount of original 
learning. J. exp. Psychol., 1951, 41, 247-260. 

. Lewis, D., Shephard, A. H., & Adams, J. A. 
Evidences of association interference in psycho- 
motor performance. Science, 1949, 110, 271- 
273. 

. Lewis, D., & Shephard, A. H. Devices for study- 
ing associative interference in psychomotor 
performance: I. The modified Mashburn Ap- 
paratus. J. Psychol., 1950, 29, 35-46. 

. Lewis, D., & Shephard, A. H. Devices for study- 
ing associative interference in psychomotor 
performance: IV. The Turret Pursuit Ap- 
paratus. J. Psychol., 1950, 29, 173-182. 

. Lewis, D., & Shephard, A. H. Prior learning as 
a factor in shaping performance curves. Proc. 
nat. Acad. Sci., 1951, 37, 124-131. 

. Lewis, D., & Smith, P. N. Retroactive facilita- 
tion and interference in performance on the 
modified Two-Hand Coordinator. Office of 
Naval Res., Special Devices Cent., Port Wash- 
ington, L. I, N. Y¥., 1951. (Tech. Rep. SDC 
166-00-2.) 

. Loucks, R. B. An experimental study of the ef- 
fectiveness with which novices can interpret a 
localizer-glidepath approach indicator. United 
States Air Force, Air Materiel Command, Day- 
ton, Ohio. USAF Technical Report No. 5959, 
1949. 

. McAllister, Dorothy, E. Retroactive facilitation 
and interference as a function of level of learn- 
ing. Amer. J. Psychol., 1952, 65, 218-232. 

. McAllister, Dorothy E., & Lewis, D. Single- 
trial-per-task versus multiple-trial-per-task in 
the acquisition of skill in performing several 





Michael Humphries 


similar tasks. Proc. lowa Acad. Sci., 1950, 57, 
407-416. 

. Mitchell, M. J. H. Direction of movement of 
machine controls: III. A two handed task in a 
discontinuous operation. Cambridge, England: 
Psychol. Lab., Ministry of Supply, 1947. (SM 
10018 (S).) 

. Mitchell, M. J. H. Direction of movement of 
machine controls: IV. Right- or left-handed 
performance in a continuous task. Cambridge, 
England: Medical Research Council, Unit of 
Applied Psychology, 1948. (MRC 48/371, 
APU No. 85.) 

Norris, Eugenia B., & Spragg, S. D. S. Per- 
formance on a following tracking task (modi- 
fied SAM Two-Hand Coordinator Test) as a 
function of the planes of operation of the con- 
trols. J. Psychol., 1953, 35, 107-117. 

. Shephard, A. H. Losses of skill in performing 
the standard Mashburn task arising from dif- 
ferent levels of learning on the reversed task. 
Office of Naval Res., Special Devices Cent., 
Port Washington, L. I., N. Y., 1950. (Tech. 
Rep. SDC 938-1-9.) 

. Shephard, A. H. The Toronto Complex Coordi- 
nator. Canad. J. Psychol., 1955, 9, 7-14. 

. Shephard, A. H., & Lewis, D. Devices for study- 
ing associative interference in psychomotor per- 
formance: II. The modified Two-Hand Co- 
ordinator. J. Psychol., 1950, 29, 53-66. 

. Speeth, W., & Lewis, D. The effects of alternat- 
ing practice on the performance of two an- 
tagonistic motor tasks. Office of Naval Res., 
Special Devices Res. Cent., Port Washington, 
L. I., N. Y., 1950. (Tech. Rep. SDC, 938-1-6.) 

. Vince, M. A., & Mitchell, M. J. H. Direction of 
movement of machine control (continued). 
Cambridge, England: Ministry of Supply. 
Psychological Lab., 1946. (SM 2861 (S).) 





Journal of Appli 
Vol. 42, No. 5, 


ied Psychology 
1958 


The Postwar Occupational Adjustment of Emotionally Disturbed 
Soldiers 


John B. Miner * 


Personnel Research Division, The Atlantic Refining Company 


and James K. Anderson 


Conservation of Human Resources Project, Columbia University 


Job applicants with a history of severe emo- 
tional disturbance have recently become a 
major problem for those concerned with 
personnel selection. Some employers have 
adopted an explicit policy forbidding the hir- 
ing of people known to have such a history 
on the assumption that they do not represent 
good employment risks. Probably the ma- 
jority, however, have established no specific 
policy, preferring to leave the matter to the 
discretion of individual employment inter- 
viewers. Prior to World War II the problem 
of establishing policy was, in fact, rarely 
raised, since interviewers found it very diffi- 
cult to obtain specific information from ap- 
plicants. 

Since the war it has become common prac- 
tice to examine the discharge papers of ap- 
plicants, or at least inquire about the condi- 
tions under which a man was separated from 
military service. Consequently, many per- 
sonnel managers have become increasingly 
aware of the very considerable incidence of 
mental disease in the population. Unfortu- 
nately, however, little information is avail- 
able as to the subsequent work histories of 
men who have suffered from severe emotional 
disturbances. Employers, caught between a 
very definite social pressure to hire all dis- 
abled veterans and their own misgivings as to 
the value of men with a history of neurosis or 
psychosis as workers, have lacked the factual 
data necessary to making valid policy de- 
cisions. 

The present paper represents an attempt to, 
at least in part, fill this gap, utilizing data 
gathered in connection with the Conservation 
of Human Resources Project’s study of in- 


1 This research was carried out while the senior 
author was on the staff of the Conservation of Hu- 
man Resources Project, Columbia University. 


317 


effective performance during World War II. 
More specifically an effort is made to investi- 
gate some of the assumptions which, for lack 
of specific information, have on occasion 
guided personnel policy in this area. These 
assumptions which form the hypotheses of 
this study may be stated as follows: 

1. Men with a history of neurosis or psy- 
chosis will reveal their difficulties in occupa- 
tional adjustment by having a higher inci- 
dence of unemployment than other men from 
similar backgrounds. 

2. Men with a history of neurosis or psy- 
chosis will reveal their difficulties in occupa- 
tional ‘adjustment by failing to obtain jobs at 
as high a level as other men from similar 
backgrounds. 


Procedure 


The Ss for the study were selected from a special 
sample consisting of five per cent of all enlisted men 
separated from the army during and immediately 
after World War II. Certain information on the 
men in the five per cent sample including date of in- 
duction and reason for separation had been punched 
on I.B.M. cards and these cards were retained at the 
Adjutant General’s Office in Washington. From this 
large sample, all men who had been inducted during 
the last four months of 1942 and who had been pre- 
maturely discharged for either neurosis or psychosis 
were selected. Both the neurotics and the psychotics 
were then divided into six groups, using certain 
length and type of service categories. These cate- 
gories were introduced in order to facilitate analyses 
beyond the scope of the present paper and have been 
described in detail elsewhere (3). The 12 groups 
(six neurotic and six psychotic) were reduced still 
further in size using a random selection process. For 
reasons extraneous to the present study, the number 
of cases included in each of the 12 smaller samples 
was not directly proportionate to the frequencies in 
the five per cent sample. 

Since, for the purposes of the ensuing analyses, 
data on the two diagnostic groups (the neurotics 
and the psychotics) only were desired, a weighting 





318 


process was used to combine the frequencies for the 
12 length and type of service samples. The weight- 
ing factors applied to data from the six neurotic and 
six psychotic samples were derived from the actual 
frequencies present in the five per cent sample. 
Through their use it has been possible to obtain 
composite estimates of the figures that would have 
been obtained if it had been possible to study all 
enlisted men inducted into the army during the last 
four months of 1942 and later discharged prema- 
turely with either a neurotic or psychotic condition. 
The diagnostic composition of these two samples has 
been indicated elsewhere (3). 

In addition to the neurotic and psychotic sam- 
ples, a control group was drawn from the original 
five per cent sample. This group consisted of en- 
listed men inducted in the last four months of 1942 
who were subsequently honorably discharged from 
the army at the end of the war as a result of de- 
mobilization. The Ss included in the control group 
were selected so as to individually match every other 
man in the neurotic sample on four variables: edu- 
cation, race, age, and county of residence. It was 
not possible to obtain perfect matching on the latter 
variable, but every effort was made to keep devia- 
tions to a minimum. Thus, if a man from a rural 
county in Georgia could not be matched with an- 
other man from the same county, he was at least 
matched with a man from another similar county in 
his own or an adjacent state. The results of the 
matching on education, race, and age are presented 
in Tables 1, 2, and 3 respectively. These tables also 
contain distributions for the psychotic sample, which 


Table 1 


Percentage of Neurotics, Psychotics, and Controls 
Attaining Various Educational Levels 
Prior to Army Induction 





School Years Completed 
06 7-8 13+ 








Group 9-11 12 





18.7 
25.2 
18.6 


Neurotics 
Psychotics 
Controls 


23.6 
18.3 
15.7 


28.0 25.3 4.4 
31.3 174 7.8 
31.4 304 3.9 





Table 2 


Percentage of Neurotics, Psychotics, and Controls 
of White and Negro Race 








-Race 





Negro 








John B. Miner and James K. Anderson 


Table 3 


Percentage of Neurotics, Psychotics, and Controls of 
Various Ages at Army Induction 


18-19 20-24 25-29 30-34 35+ 


Group N 

Neurotics 225 76 556 17.3 10.2 9.3 
Psychotics 115 95 583 174 87 61 
Controls 102 88 559 225 98 3.0 


did not serve directly as a basis for the selection of 
the contro! group. Nevertheless, neither the psy- 
chotics nor the neurotics differ reliably from the con- 
trols on any of these three variables. Chi-square 
analysis using the .05 level as a criterion of adequacy 
indicates that the matching process was satisfactory. 
The N values noted in these tables require some 
further explanation. It was not possible to obtain 
adequate follow-up information on the employment 
status of all Ss. Postwar information was available 
on 88.7 per cent of the 115 controls, 97.4 per cent of 
the 231 neurotics, and 95.8 per cent of the 120 psy- 
chotics. These smaller numbers have been employed 
in this and all ensuing analyses. Since the total neu- 
rotic sample rather than the half which served as 
a basis for matching has been utilized, the neurotics 
and controls do not appear as perfectly matched. 
Extensive information on all Ss was obtained from 
the files of the Army Records Center in St. Louis. 
These files contain service records, medical records, 
and separation board proceedings. In addition, con- 
siderable follow-up data have been accumulated. 
The VA files were checked through 1953. Many of 
the Ss obtained disability compensation and medical 
treatment from the VA. Others have received ex- 
tensive education and training, frequently coupled 
with vocational guidance interviews and testing. As 
a result of these contacts, the VA has built up a 
rather complete dossier on the postwar employment 
of these men. When the VA data were, for some 
reason, incomplete, a situation which was most likely 
to arise among the control Ss, they were supple- 
mented through the use of questionnaires mailed to 
the veterans’ homes. These questionnaires, which 
inquired specifically into a man’s occupational and 
educational adjustment, were mailed during the latter 
part of 1954. It was only in those few instances 
when an S’s VA file contained no information on 
postwar employment and it was impossible to con- 
tact him by mail that a man had to be eliminated 
from the sample because of inadequate information. 
For the purposes of the present analyses, only the 
most recent available data have been employed. A 
man was categorized as a student, unemployed, or 
working as of the last date such information was ob- 
tained, whether from the VA files or the question- 
naire. These terminal dates are indicated in Table 4. 
Although the information on the controls tends to 





Adjustment of Emotionally Disturbed Soldiers 


Table 4 


Year of Last Available Follow-up Information for 
Neurotics, Psychotics, and Controls 


Group 


Controls 


(%) 


Neurotics 


Psychotics 


(%) (%) 


194546 3.1 4.2 1.0 
1947-48 4.1 7.0 5.9 
1949-50 12.4 7.0 69 
1951-52 26.2 20.9 8.8 
1953-54 54.2 60.9 774 


be somewhat more recent than that on the psychi- 
atric groups, the controls were also discharged from 
the Army later. Thus, the period since leaving mili- 
tary service may be considered as of essentially the 
same duration for all groups. 

In order to investigate the first hypothesis, esti- 
mates of the extent of unemployment among pre- 
maturely discharged neurotics and psychotics were 
obtained using the weighting process previously de- 
scribed. The control group figure was derived by 
applying the six weighting factors developed for use 
with the neurotics to the control data as well. The 
incidence of unemployment among the neurotics and 
psychotics was compared with that for the matched 
controls using chi-square techniques. 

The investigation of the second hypothesis requires 
a specification of the occupational level attained by 
those men who were employed at the time of fol- 
low-up. For this purpose the system of classification 
according to intellectual demand described in Intelli- 
gence in the United States (2) was used. The sys- 
tem assigns each occupation to one of four levels, 
depending on the intelligence required for adequate 
performance. The levels may be generally described 
as follows: 


Level 1: High level professional workers, execu- 
tives, those involved in the most tech- 
nical and complex clerical and sales work, 
and large scale farmers. 

Retail managers, the more highly skilled 
workers, skilled clerical workers, fore- 
men, wholesale salesmen, technicians, 
lower level professional workers, and 
relatively large scale farmers. 

Lower level skilled workers, the semi- 
skilled, routine clerical workers, retail 
sales clerks, most farmers, and proprietors 
of relatively simple businesses. 

Level 4: Unskilled workers. 


After the occupational level of all employed mem- 
bers of the neurotic, psychotic, and control samples 


319 


had been established and the distributions weighted,” 
the values for the neurotics and psychotics were com- 
pared with those for the controls. In addition, the 
number of fieurotics and psychotics employed at 
each level was compared with similar figures obtained 
as a result of categorizing the occupations found in 
a representative sample of the employed population 
as of 1953. Detailed descriptions of this sample, 
which contained 745 Ss, have been presented else- 
where (2, 4). 

A second analysis was also carried out using the 
total neurotic, psychotic, and control samples. In 
this instance the occupational level of the last known 
job held by those men who were either students or 
unemployed at the time of follow-up was determined 
and these occupational distributions added to those 
of the employed. Comparisons between groups were 
then made in the same manner as previously indi- 
cated. 


Results 


The incidence of unemployment in the rep- 
resentative sample of neurotics inducted in 
late 1942 is 13.4 per cent. This figure com- 
pares with a control value of 7.4 per cent. 
The difference is not reliable (x? = 2.58, P > 
.10). Among the psychotics, on the other 
hand, unemployment is quite prevalent with 
48.9 per cent out of work at the time of fol- 
low-up. When compared with the controls, 
this figure yields a highly reliable difference 
(x? = 44.77, P < 01). Thus, the results ob- 
tained with the psychotic group lend consid- 
erable support to the first hypothesis. There 
is no clear-cut evidence, however, that the 
incidence of unemployment among former 
neurotics is any higher than among other men 
from similar backgrounds. 

When the occupational distribution of the 
employed neurotics is compared with that of 
the controls (Table 5), a highly reliable dif- 
ference is obtained (,* = 13.51, P < .01). 
Detailed analysis indicates that this results 
from an underconcentration of neurotics in 
the Level 2—skilled occupations—and a cor- 
responding increase in Level 4—unskilled 
workers—( *s = 6.76 and 6.71 respectively, 
P< .01 in both cases). When a similar 
analysis using the employed population as of 


2As with the unemployment figure, weighting of 
the control distribution was carried out using the six 
factors developed for use with the neurotics. The 
use of the factors derived from data on the incidence 
of psychosis within the five per cent sample would 
not ae brought about an appreciable change in the 
results, 





John B. Miner and James K. Anderson 


Table 5 


Percentage of the Employed Neurotics, Psychotics, and 
Controls and of the Employed Population 
Working at Each Occupational Level 








Occupational Level 





Group N 1 2 3 4 





Neurotics* 193 
Psychotics* 56 
Controls* 90 
Employed Population 745 


4.6 21.2 48.7 25.5 
7.6 30.6 40.5 21.3 
6 35.6 51.8 12.0 
12.2 31.7 38.4 17.7 





* Based on weighted data. 


1953 is carried out, this tendency to lower 
level employment among the neurotics is even 
more pronounced (x? = 22.96, P < .01). The 
neurotics are reliably below the population in 
Level 1 and Level 2 employment and reliably 
above at Levels 3 and 4 (,*s = 9.38, 8.12, 
6.84, and 5.93, all with P < .01 except at 
Level 4 where P is < .02). The difference 
between the findings obtained with the 
matched controls and, the employed popula- 
tion is attributable to the relatively greater 
incidence of Level 1 employment in the popu- 
lation as a whole and the decreased propor- 
tion of Level 3 workers. 

Similar results are obtained when the last 
known occupation and the total neurotic sam- 
ple form the basis for the analysis (Table 6). 
The neurotics and neurotic controls still differ 
(x? = 14.38, P < .01) and this difference is 
still attributable to the lower, Level 2 fre- 
quency and higher, Level 4 frequency among 
the neurotics (x°s = 6.13, P < .02 and 8.90, 


Table 6 


Percentage of Neurotics, Psychotics, Controls, and 
Employed Population Working at Each 
Occupational Level at the Time of 
Last Known Employment 








Occupational Level 
Group N 1 2 3 4 


Neurotics* 225 «3.9 204 45.3 304 
Psychotics* 115 3.6 17.1 293 500 
Controls* 102 7 33.1 51.3 149 
Employed Population 745 12.2 31.7 384 17.7 











* Based on weighted data. 


P< .0l, respectively). The neurotic-em- 
ployed population comparison is also reliable 
(x* = 34.54, P < .01), and again the under- 
concentration of neurotics in Levels 1 and 2 
and the overconcentration at Level 4 occurs 
(x’*s = 12.78, 10.68, and 16.93, respectively, 
all with P < .01). A reliable difference no 
longer appears at Level 3. 

In spite of the consistent evidence of lower 
level employment among the neurotics, no 
such result is obtained when the distribution 
of the employed psychotics is compared with 
that of the controls (Table 5). The differ- 
ence is not reliable (y? = 5.80, P > .10). 
Similarly, the psychotic-employed population 
comparison does not produce evidence of a 
reliable difference (y* = 1.41, P > .10). 

When, however, the last known occupations 
of the students and unemployed are added to 
the employed distribution (Table 6), a sig- 
nificant chi-square value is obtained (,? = 
34.52, P< .01). The psychotics are less 
likely to have been employed in Levels 2 and 
3 occupations than the controls and much 
more likely to have been working at Level 4 
jobs (y*s = 7.53, 11.29, and 29.99, all with 
P< .01). When the psychotics are com- 
pared with the employed population of 1953, 
a similar result is obtained (x? = 63.19, P < 
.01). The psychotics are underrepresented at 
Levels 1 and 2 and heavily concentrated at 
Level 4 (xs = 7.56, 10.10, and 60.49, respec- 
tively, all with P < .01). 


Discussion 


Although the possibility that the lower level 
of employment among the neurotics and the 
high incidence of unemployment among the 
psychotics might be attributable to differences 
in education, race, or age has been ruled out 
through the use of matched controls, it is pos- 
sible that these findings are, at least in part, 
attributable to a restriction of the occupa- 
tional opportunities available to men who 
have received Certificate of Disability Dis- 
charges for emotional reasons. There is rea- 
son to believe, however, that this is not a 
major factor contributing to the results. Very 
rarely do statements appear in either the VA 
records or the questionnaires of men separated 
for neurosis or psychosis indicating that they 





Adjustment of Emotionally Disturbed Soldiers 


have had difficulty obtaining employment be- 
cause of the nature of their discharges. Fur- 
thermore, the VA records and questionnaires 
contain considerable evidence that the VA fre- 
quently gave special employment assistance to 
these veterans. As a result, a number appear 
to have been hired for positions in industry 
as well as in government which they, in all 
probability, would have been unable to ob- 
tain if they had not been discharged with a 
service-connected disability. Certainly, the 
very marked differences in the postwar ad- 
justment patterns of the neurotics and psy- 
chotics, both of whom received the same type 
of discharge, are not readily explainable in 
terms of restricted occupational opportunity 
due to conditions of separation from military 
service. 

Although the high incidence of unemploy- 
ment among the psychotics results in part 
from the fact that a number of these men 
were undergoing long-term hospitalization at 
the time of follow-up, this is by no means the 
only determinant. Slightly over 65 per cent 
of the psychotic unemployed were not in hos- 
pitals. Among the neurotics 83 per cent of 
the unemployed were not institutionalized at 
follow-up. None of the controls were in hos- 
pitals. It is also clear from the analysis of 
occupational level at the time of last known 
employment that the majority of the unem- 
ployed psychotics last worked at Level 4, un- 
skilled jobs. This appears to be true of 75 
per cent of the psychotics who were not work- 
ing at follow-up. The figure for the neurotics 
is 61 per cent, but of course much fewer were 
out of work. Because of the low number of 
unemployed controls and the difficulties in- 
volved in applying weighting procedures to 
such a small sample, a reliable figure cannot 
be given for this group, but it is almost cer- 
tainly not over 50 per cent, and probably con- 
siderably below that. 

This suggests that there are, among those 
with a history of psychosis, a large number of 
men who drift in and out of unskilled jobs, 
but spend most of their time in an unem- 
ployed status. If this is true, one would ex- 
pect to find a group of individuals who “just 
happened” to be employed at the time of fol- 
low-up included in the Level 4 frequency. A 


321 


check on this assumption reveals that almost 
40 per cent of the psychotics who were em- 
ployed in unskilled jobs at follow-up had a 
history of long periods of unemployment and 
frequent job changes. Among this latter 
group as well as among the unemployed psy- 
chotics who had last worked in Level 4 po- 
sitions, the usual pattern was not a gradual 
or even a precipitous decline in employment 
level. Characteristically, these men had never 
worked in any job above the semiskilled level; 
perhaps the majority, never above the un- 
skilled. 

Within the limits of generalization possible 
from the present samples, several guidelines 
for personnel policy emerge. Although these 
conclusions cannot be assumed to have valid- 
ity in all employment situations, they should 
in most instances contribute to the efficiency 
of a company’s personnel procedures. Ideally, 
however, local studies, which take into ac- 
count variations in the labor market as well 
as company hiring and promotion procedures, 
should be carried out before policy is estab- 
lished. 

Men with a history of psychosis apparently 
divide into two groups of approximately equal 
size. The first, as far as is presently known, 
make quite satisfactory employees. Further 
study may indicate that in spite of the fact 
that they adjust at approximately the same 
occupational level as other men from similar 
backgrounds, they do so at the expense of 
somewhat poorer performance. There is at 
present, however, no evidence to support this 
conclusion. The second group, on the other 
hand, appears to represent a poor employ- 
ment risk. These men, whose histories may 
be expected to contain evidence of consider- 
able unemployment, are not likely to stay on 
the job very long—either they will have to be 
released or they will leave on their own voli- 
tion. They are very likely to have held only 
unskilled positions previously. Any man with 
a history of psychosis, who has worked at un- 
skilled jobs in the past and who has been un- 
employed over considerable periods of time, 
should, under economic conditions similar to 
those of the postwar period, be hired only 
with the full realization that there is a high 








322 


probability of his soon becoming unemployed 
again. 

By way of contrast, the neurotics as a group 
would appear to represent a much better em- 
ployment risk. They are not likely to spend 
long periods among the ranks of the unem- 
ployed. However, men with a history of neu- 
rosis cannot be expected to attain supervisory 
and skilled positions as frequently as other 
employees. An employer who hires such a 
man with the expectation of moving him into 
a higher level position is likely to be disap- 
pointed. This does not imply that the for- 
mer neurotic is necessarily a poor worker. 
The limited evidence we have suggests that as 
a group neurotics are likely to perform just 
as well as other employees, at least in lower 
level jobs (1). It seems probable, however, 
that they do typically lack whatever qualities 
make for placement or continued employment 
in supervisory and skilled positions. 


Summary 


Employment status approximately 10 years 
after discharge was determined for representa- 
tive samples of men inducted into the army 
during late 1942 and prematurely separated 
for neurosis and psychosis. Similar informa- 


tion was obtained on a control group matched 


John B. Miner and James K. Anderson 


in terms of education, age, race, and county 
of residence. It was found that nearly half 
of the men with a history of psychosis were 
unemployed at follow-up as compared with 
approximately seven per cent of the controls. 
The neurotics did not have a reliably higher 
incidence of unemployment, but did tend to 
concentrate in low level occupations, particu- 
larly those of an unskilled nature. The pro- 
portion of men working in skilled and super- 
visory positions among the neurotics was con- 
siderably below that of the controls. This 
downward shift in occupational level was not 
present among those psychotics who were em- 
ployed. 


Received January 2, 1958. 


References 


1. Markowe, M. Occupational psychiatry: An his- 
torical survey and some recent researches. J. 
ment. Sci., 1953, 99, 92-101. 

2. Miner, J. B. Intelligence in the United States. 
New York: Springer, 1957. 

3. Miner, J. B., & Anderson, J. K. Intelligence and 
emotional disturbance: Evidence from Army 
and Veterans Administration records. J. ab- 
norm. soc. Psychol., 1958, 56, 75-81. 

4. Tomkins, S. S., & Miner, J. B. The Tomkins- 
Horn Picture Arrangement Test. New York 
Springer, 1957. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Adequacy of the Residual Sensory Cues for Psychomotor Performance 
of Arm Amputees *’ 


Hilde Groth and John Lyman 


University of California, Los Angeles 


Amputation of the upper extremity deprives 
the individual not only of much of his normal 
motor function but also of a large amount of 
the sensory information which regulated this 
motor function. Although artificial arms con- 
stitute a fairly efficient mechanical replace- 
ment for some of the incurred motor loss, ex- 
perimental results indicate that performance 
of visuo-motor tasks with a prosthesis shows 
a considerable decrease of performance speed 
(3). For the purpose of future design modifi- 
cations it is of practical interest to ascertain 
whether this performance decrement should be 
attributed primarily to the loss of sensory in- 
formation and, hence, to inadequate sensory 
regulation of the prosthesis movements or to 
a relatively inadequate motor replacement by 
the prosthetic device. Both of these possi- 
bilities focus on a third alternative, an in- 
crease in “central integration time.’ For the 
amputee any psychomotor task is necessarily 
much more complex than for the nonamputee 
since the prosthesis is interposed between the 
muscular motion and the control operation 
as an additional technical link. Aside from 
mechanical delays in the device itself, coordi- 
nation of this extra link into the operating 
cycle conceivably can prolong the reaction 
time. 

The specific purpose of this study was to 
investigate the extent of the usefulness of re- 
maining sensory cues for prosthesis perform- 
ance on two visuo-motor tasks. The sensory 
cues involved consist of a composite of afferent 
information received from the pressure of the 


1 This work was supported by Contract No, VAm- 
23110 between the Veterans Administration and the 
Department of Engineering, University of California, 
Los Angeles. The opinions expressed are those of the 
authors and do not necessarily represent those of the 
Veterans Administration. 

® The authors wish to thank C. L. Taylor, Project 
Leader, for his cooperation in permitting this re- 
search to be undertaken, and J. R. Zweizig for the 
develspment of the electronically controlled ap- 
paratus. 


socket on the stump, from the pressure exerted 
by harness straps, from movements of the 
stump and shoulder girdle during prosthesis 
operation and, in certain cases, from move- 
ments of and pressures within the biceps 
cineplastic tunnel as well as an undefined 
amount of visual information.‘ For labora- 
tory investigation of the role of these cues a 
manipulation and tracking task was chosen to 
provide information about the following varia- 
bles: 


Task 1—Manipulation Task 
a. Adequacy of object weight perception 
with pressure cues. 
b. Adequacy of response speed on a task re- 
quiring gross motions with the prosthesis. 
c. Problem of “central integration time” in 
a complex task. 


Task 2—Tracking Task 

a. Adequacy of pressure cues for dynamic 
force adjustments. 

b. Adequacy of response speed when no 
gross motions of stump and prosthesis 
are required. 

c. Problem of “central integration time” in 
a complex task. 


Accuracy of weight perception during ma- 
nipulation was considered to be a fair crite- 
rion for the adequacy of the available sensory 
cues. An excessively high level of grasp force 
during manipulation in comparison to the 
level of force exerted by nonamputees on the 
same task would be indicative of too little 
“feel” in the prosthesis. Accuracy of dynamic 
pressure discrimination, i.e., performance of a 
task requiring continuous adjustments of grasp 
force without change in terminal device open- 
ing, by exclusion of gross motions, was ex- 


8A cineplastic tunnel is a skin-lined tunnel con- 
structed surgically in the biceps or other muscle and 
used as a “muscle motor” to power the control cable 
for prehension. A description has been published 
elsewhere in this journal (3). 





324 


pected to provide an estimate of the pressure 
sensitivity of the cineplastic tunnel or alterna- 
tively of the shoulder girdle areas underneath 
the harness straps. 


Procedure 
Subjects 


Group I: 10 right-handed engineering students per- 
forming with the right hand. 

Group II: 10 right-handed engineering students per- 
forming with the left hand. 

Group III: 8 unilateral below elbow amputees wear- 
ing harness-controlled prostheses, performing 
with the hook. 

Group IV: 4 unilateral below elbow amputees wear- 
ing cineplasty-controlled prostheses, performing 
with the wae 

Group V: 3 unilateral above elbow amputees wearing 
harness-controlled prostheses, performing with 
the hook. 


All amputees were regular prosthetic hook wearers 
and had served repeatedly as research amputees in 
the laboratory. The students were recruited from 
classrooms. 

Performance of unilateral amputees with the arti- 
ficial limb was considered as performance with the 
nonpreferred limb regardless of which side had been 
amputated. 


Apparatus and Tests 


Manipulation apparatus. The electronically con- 
trolled instrument has been described elsewhere (2). 
Its main components consisted of a 3 X 6 control 
board matrix, a 3 X 6 lightbank display matrix, and 
a thin-walled split aluminum cylinder force trans- 
ducer instrumented with Baldwin Lima SR-4 strain 
gages. The cylinder had a diameter of 5.0 cm., a 
slot opening of 1 mm., and weighed 422 gms. It was 
designed to assure nearly isometric muscle contrac- 
tion during prehension. The apparatus permitted 
simultaneous recording of the integral of prehension 
force over the duration of the trial and of the total 
manipulation time for the trial length. Mean prehen- 
sion force and time per transport were taken as cri- 
terion measures. Task performance required only 
the simple motion elements of grasp, transport and 
release. Task complexity was changed in terms of 
pace of performance and pattern of motion. The ar- 
rangement was placed on a table 30” high. 

Tracking apparatus. Details of the instrument 
have been described in detail elsewhere (2). A 
“random” target problem was generated on a dual 
beam oscilloscope in the form of a vertically moving 
line across the face. The line was tracked with a 
light dot controlled by prehension force variations. 
The same split cylinder force transducer as was ma- 
nipulated in Task 1 was placed on a support and 
used as a controller. The target line had a maximal 
vertical motion range of +3.0 cm. from the center of 


Hilde Groth and John Lyman 


the scope face. The tracking dot had its zero posi- 
tion 3.0 cm. below the center and had a maximal 
vertical range of +6.0 cm. The force required for 
tracking with this dot was approximately 165 gm./ 
cm. The apparatus was placed on a table 30” high; 
the force transducer rested on a 10” stand in front of 
the scope. Number of target hits, time on target 
and time on target per hit contributed the criterion 
measures. 

Terminal devices for amputees. A left or right 
APRL voluntary closing hook, modified by removal 
of the locking mechanism, was worn by each am- 
putee. These hooks have been described previously 
(3, 4). Voluntary closing hooks were chosen be- 
cause the amputee had to apply control cable tension 
in direct proportion to the prehended load. The 
ratio between output force at the hook tip and con- 
trol cable force was measured as 2.5:1. Prehension 
force of 1,000 gm. thus required a cable pull of about 
2,500 gm. 

The hook tips were covered with commercial rubber 
sleeves in order to obtain similar conditions of surface 
friction for bare hand and hook performance. The 
coefficient of static friction between skin and alumi- 
num was approximately 0.7 and for the rubber 
sleeves it was 0.5. 

Experimental noise. Since there was a possibility 
that amputees would respond to some auditory cues 
from the prosthesis, head phones were worn by all 
Ss. A random noise generator provided the mask- 
ing noise. The head phone output was adjusted to 
90 db SPL. 


Routine 


The following six treatments were administered in 
a subject by treatment design during a single ex- 
perimental session: 


1, Pursuit tracking task; duration: 5 min. 

2. Manipulation task, self-paced, regular pattern; 
duration: 10 min. 

3. Manipulation task, self-paced, random pattern; 
duration: 10 min. 

4. Manipulation task, experimenter-paced, regular 
pattern; duration: 10 min. 

5. Manipulation task, experimenter-paced, random 
pattern; duration: 10 min. 

6. Manipulation task, speed trial. 


Treatments 2, 3, 4 and 5 were given in random order. 

Tracking task. The cylinder was resting on a sup- 
port in front of the oscilloscope and S pursued the 
moving target line with the light dot for five minutes. 

Manipulation task. 1. Pace of performance: (a) 
self-paced (SP): S was asked to adjust his pace so 
that he could perform the task comfortably as an 
eight-hour job. (6) Experimenter paced (EP); the 
stimulus lights changed at 2-sec. intervals. 

2. Movement pattern. (a) Regular pattern. The 
Ss started at the lower left side of the control board 
and placed the cylinder into the adjacent hole, work- 
ing back and forth, toward and away from the body. 
Under the EP condition, the signal for each move 








Psychomotor Performance of Arm Amputees 


Table 1 


Mean Prehension Forces and Variabilities on Manipulation Task 


Treatments 


Self-paced 
Regular pattern 
Self-paced 
Random pattern 


1,174 


Experimenter-paced 
Regular pattern 


1,213 


Self-paced 
Speed trial 


1,305 


was given by the flashing of a small light bulb 
fastened in the center of the upper border of the 
display panel. (6b) Random pattern. The Ss placed 
the cylinder into that hole of the control panel which 
corresponded in location to the lighted circle of the 
display matrix. In the SP condition, the display 
light would change to the next position as soon as 
the contact of the preceding move was made. For 
the EP condition, the location of the stimulus light 
would change at two seconds’ intervals. 

3. Speed trial. The Ss were asked to perform five 
rounds of the regular pattern as fast as possible. No 
head phones were worn for this condition. 

Each task was explained in detail and demon- 
strated by E before the experimental run. 


PF 
Xx (gm.) 


1,022 





Amputees 


PF PF PF 
s (gm.) Xx (gm.) s (gm.) 


561 1,478 769 





617 1,359 558 


254 1,328 522 


898 1,332 


Note.—Just slip force: 1. bare hands ~604 gm., 2. hooks ~844 gm. 


Nonamputee Ss grasped the cylinder around the 
upper rim with the pads of all five fingers leaving 
the slot between the thumb and index finger. Ampu- 
tee Ss grasped it with the length of the rubber cov- 
ered hook tips with the slot pointing away. By this 
method we were able to approximate the sizes of the 
contact areas of bare hand and hook performance to 
satisfy the equation, F = P X A, where F = force, P 
= pressure, A = area of contact. 

All treatments were administered while the Ss were 
standing. 

Calculations. All comparisons for statistical infer- 
ence were made by nonparametric methods since 
data of amputee performance shoved neither nor- 
mality of trait distribution nor homogeneity of vari- 
ance (8). The significance level was set at P < 01. 


Table 2 





Mean Times per Transport and Variabilities in Manipulation Task 





Nonamputees 


Amputees 





T/tr 


Treatments xX (sec.) 





T/tr 


T/tr 
RX (sec.) 


s (sec.) 


T/tr 
s (sec.) 





Self-paced 55 
Regular pattern 


Self-paced 70 
Random pattern 


Experimenter-paced 
Regular pattern 
Self-paced 

Speed trial 


14 1.01 32 


21 1.20 36 


.90 -16 


78 .23 








Hilde Groth and John Lyman 


Table 3 


Mean Tracking Performance and Variability of Amputees and Nonamputees 








Criterion Measure 





Nonamputees 


Xx s 





Total number of hits 

Number of hits—first minute 
Number of hits—fifth minute 

Time on target—first minute 

Time on target—fifth minute 

Time on target per hit—first minute 
Time on target per hit—fifth minute 


Results * 


Manipulation Task Comparison of Prehension 
Forces (PF) 


Comparisons of amputee with nonamputee 
performance on this task were made between 
all amputees (Groups III, IV, V) and the 10 
nonamputee Ss (Group II) performing with 
the nonpreferred hand. 

Inspection of the mean PF values for four 
treatment conditions for these groups showed 
consistently higher force levels for the ampu- 
tees. However, statistical comparisons of the 
median differences by the Mann-Whitney U 
test failed to reach significance. 

No data could be reported for the experi- 
menter-paced random pattern condition since 
none of the amputees were able to complete 
this task. Complexity of the task led to con- 
fusion and performance breakdown in each 
case. 

Mean values and variabilities are shown in 
Table 1. 


Manipulation Task: Comparison of Times 
Per Transport (T/tr) 


Statistical comparison between amputee and 
nonamputee performance by the Mann-Whit- 
ney U test showed that amputees performed 
consistently slower. This difference was sig- 
nificant for each of the four conditions. The 


* The statistical tables have been deposited with the 
American Documentation Institute. Order Document 
No. 5690 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, Wash- 
ington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to Chief, Photoduplication Service, Library 
of Congress. 





39.49 
10.91 
8.38 
3.24 


302.75 
62.30 
62.15 
31.50 
28.20 5.51 

0.53 0.13 
0.47 0.14 


mean values and variabilities are reported in 
Table 2. 


Manipulation Task: Performance of Amputee 
Subgroups 


The small size of the amputee subsamples 
did not permit any statistical comparisons of 
performance. However, inspection of the raw 
data indicated a lack of performance differ- 
ences due to amputation level or method of 
muscle-prosthesis coupling. 


Tracking Task: Comparison of Target Hits 


Tolerance limits for “on target” were set as 
a 6 mm. wide band on the scope face. Visu- 
ally this meant for the S that he was on tar- 
get whenever the perimeter of the tracking 
dot touched the target line. 

Comparisons of amputee with nonamputee 
performance have been made between all am- 
putees (Groups III, IV, V) and all nonampu- 
tees (Groups I, II). Combining Groups I 
and II seemed justified since none of the Ss 
had any previous experience with a similar 
task. 

Statistical comparisons were made for three 
time intervals: (a) total number of hits dur- 
ing the five-minute trial; () number of hits 
during the first minute; (c) number of hits 
during the fifth minute. None of these com- 
parisons reached statistical significance. 


Tracking Task: Comparison of Time on 
Target 


Comparisons have been made for the first 
and fifth minutes of performance. None of 
the differences reached statistical significance. 





Psychomotor Performance of Arm Amputees 


Tracking Task: Comparison of Time on 
Target Per Hit 


Comparisons for first and fifth minute of 
performance again failed to differentiate be- 
tween amputees and nonamputees. 

Table 3 summarizes the mean values and 
variabilities for the tracking task. 


Tracking Task—Performance of Amputee 
Subgroups 


No statistical comparisons were possible be- 
cause of the small size of the subsamples. In- 
spection of the raw data again failed to indi- 
cate pronounced performance differences at- 
tributable to amputation level or method of 
coupling. However, one of the Ss with cine- 
plastic prosthesis control could not complete 
the task because of a tremor in the biceps. 


Discussion 


To the authors’ knowledge, no systematic 
investigation of the role of cutaneous and 
kinesthetic information during active motor 
performance with artificial arms has been 
conducted previously. The present study also 
falls far short of the goal of assessing the im- 
portance of the various available cues in psy- 
chomotor skills. For example, it would have 
been desirable to study performance changes 
during selective anesthetization of tactile and 
proprioceptive afferents without also affecting 
the motor efferents. The authors made some 
preliminary attempts to do this and failed, 
for present methods of anesthesia do not 
seem to permit adequate separation of sen- 
sory components. 

The problem, therefore, has been approached 
on a less refined level of analysis and the re- 
sults provide at best an initial evaluation of 
the practical utility of the composite of all 
residual cues. 

The principal findings of this experiment 
can be summarized as follows: 

1. Amputees, regardless of amputation level 
and type of prosthesis control, made adequate 
use of the available cues. This was evidenced 
by prehension force adjustments when ma- 
nipulating an object of unknown weight as 
well as by the tracking scores which also were 
dependent on the accuracy of pressure dis- 
criminations. 


327 


2. Amputees showed a pronounced impair- 
ment in response speed when the task in- 
volved gross motions as was shown on the 
manipulation task. Response speed on the 
tracking task, which did not involve gross 
movements with the prosthesis, seemed to be 
comparable to normal hand performance. ° 

3. Task complexity requiring additional 
“central integration time” as accomplished by 
imposition of a random pattern upon the ex- 
perimenter paced manipulation task led to 
breakdown of performance for all amputees. 

From these results we inferred that ampu- 
tees apparently have fairly adequate sensory 
control for execution of simple visuo-motor 
tasks. However, visuo-motor tasks requiring 
gross motions show impairment of amputee 
performance due to an increase in time needed 
for completion of a given movement with the 
prosthesis. This was brought out when speed 
stress was increased by changing the task com- 
plexity. 

The present investigation does not permit 
us to estimate the relative importance of vi- 
sion to the reliable interpretation of the cu- 
taneous and proprioceptive cues perceived dur- 
ing performance. It seems to be fairly well 
established that vision alone in the absence 
of other sensory information cannot serve as 
an adequate source of information for motor 
performance (6,7). This would be expected 
to be particularly the case where the task 
requires accurate prehension force control as 
was true in our tracking task. On the other 
hand it has been found that blind-folded per- 
formance will cause a decrement in manipula- 
tion as well as in travel time, and even after 
prolonged practice these components of per- 
formance will not reach the level of proficiency 
attainable with unrestricted vision (5). An 
experiment in this laboratory which compared 
a prosthesis of the type used here with a 
prosthesis in which prehension force was elec- 
trically actuated by an on-off switch showed 
an increase in reaction time only for the elec- 
tric arm when visual cues were removed (1). 
Because of the type of actuation fewer non- 
visual cues were available for the electric arm. 

The present results as well as some of our 
earlier ones (1, 4) seem to indicate clearly 
that amputees are capable of using nonvisual 





328 


as well as visual cues effectively with present 
prosthesis designs. Accordingly, we feel that 
no specific sensory aids should be recom- 
mended for prosthesis design until both the 
adequacy and reliability of nonvisual cues 
have been studied for tasks in which the 
obstruction of vision has been investigated 
further. 


Summary 


This investigation was designed to evaluate 
three possible sources of performance decre- 
ments of amputees: (a) Inadequate sensory 
information during prosthesis performance. 
(5) Increase in duration of the mechanical 
motions in comparison to normal movements. 
(c) Increase of the “central integration time” 
during a task due to the complexity added by 
the prosthesis. Identification of the principal 
source of decrement would serve as an indi- 
cant for the site of modification of the engi- 
neering design of the prosthesis. This modi- 
fication could take the form of sensory aiding 
devices, improvement of the mechanical func- 
tion, or simplification of the coupling to the 
control musculature. 

Two visual motor tasks were chosen to pro- 
vide information about these possible sources 
of performance deterioration: (@) Manipula- 
tion task; (5) pursuit tracking task. Twenty 
nonamputees and 15 unilateral amputees 
served as experimental Ss. The amputees 
consisted of eight below-elbow amputees with 
harness control, four below-elbow amputees 
with cineplastic control, dnd three above- 
elbow amputees with harness control. 

The principal findings were: (a) Perform- 
ance showed that all amputees received ade- 
quate pressure cues; (4) all amputees showed 


Hilde Groth and John Lyman 


a pronounced impairment in response speed 
when the task required gross movements with 
the prosthesis; (c) task complexity inducing 
time stress by requiring additional “central 
integration time” led to performance break- 
down and confusion. 

Since the study did not permit an estima- 
tion of the adequacy and reliability of cu- 
taneous and proprioceptive cues in the ab- 
sence of vision, another investigation in which 
vision is obstructed has been suggested. 


Received January 2, 1958. 


References 


1. Gottlieb, M., Santschi, W., & Lyman, J. Evalua- 
tion of the Model IV-E2 electric arm. De- 
partment of Engineering, Univer. California, 
Los Angeles, 1953. (Rep. 53-23.) 

. Groth, Hilde. An experimental assessment of pre- 
hension force as a measure of effort in psycho- 
motor skills. Unpublished doctoral disserta- 
tion, Univer. California, Los Angeles, 1957. 

. Groth, Hilde, & Lyman, J. Relation of the mode 
of prosthesis control to psychomotor perform- 
ance of arm amputees. J. appl. Psychol, 
1957, 41, 73-78. 

. Groth, Hilde, & Lyman, J. A comparison of two 
modes of prosthetic prehension force control 
by arm amputees. J. appl. Psychol., 1957, 41, 
325-328. 

. Huiskamp, J., Smader, R. C., & Smith, K. N. Di- 
mensional analysis of motion: IX. Compari- 
son of visual and nonvisual control of com- 
ponent movements. J. appl. Psychol. 1956, 
40, 181-186. 

. Lassek, A. M. Effect of combined afferent lesions 
on motor function. Neurology, 1955, 5, 269- 
272. 

. Twitchell, Th. E. Sensory factors in purposive 
movement. J. Neurophysiol., 1954, 17, 239- 
252. 

. Walker, H. M., & Lev, J. Statistical inference. 
New York:- Holt, 1953. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Curriculum Assessment with Critical Incidents 


Albert S. Glickman 


U. S. Naval Personnel Research Field Activity, Washington + 


and T. R. Vallance 


Human Resources Research Office 


By and large, even in industry, adequacy 
of training programs has been assessed by 
subjective judgment rather than by research 
methods (5). As Flanagan (1, 2) has re- 
marked, there is need to supplement the usual 
collection in conference of “leaders” or “ex- 
perts” by systematic collection and analysis 
of factual data. 

This investigation explores the use of “criti- 
cal incidents” (1, 2, 3) for assessing the rele- 
vance of various aspects of a training pro- 
gram to the job performance requirements of 
its graduates. Specifically, it involves the 
relevance of subject matter taught in the 
Navy’s Officer Candidate School (OCS) to the 
duty requirements of new officers aboard de- 
stroyer-type ships. 

The 17-week OCS training aims to prepare 
the new ensign to profit from experience and 
rapidly achieve a satisfactory level of job per- 
formance. It is not intended to produce spe- 
cialists. Rather, the purpose is to provide 
junior officers with the familiarity in subject 
matter areas containing technical problems 
needed to develop an understanding of a va- 
riety of technical functions, and to develop 
ability to supervise technicians. 

The critical incident procedure, as adapted 
here, takes cognizance of the supervisory na- 
ture of the junior officer's duties and at the 
same time permits the use of duty perform- 
ance behaviors of sufficient specificity to in- 
dicate OCS curriculum elements most and 
least relevant to significant shipboard duties. 
Also, by concentrating upon observed actual 


1 This research was conducted while the authors 
were with the Officer Personnel Research Program 
of the American Institute for Research, working un- 
der contract Nonr 890(01) with the Office of Naval 
Research. A more extensive report was prepared at 
that time (4). The viewpoints expressed herein are 
not to be construed as those of the U. S. Navy or 
U. S. Army. 


performances on duty, the procedure makes it 
possible to identify duty areas which are not 
covered by instruction, thus contributing to a 
further definition of what the objectives and 
content of training should be. 

The essential questions to which this re- 
search is addressed are these: “What are the 
things which happen frequently to ensigns, or 
which their superiors think they ought to be 
able to handle soon after reporting aboard?” 
and then, “Are these the things to which at- 
tention and emphasis are also given by OCS 
instruction?” 

Our legitimate concern is exclusively with 
the performance requirements (and relevant 
training) reflected in the content of incidents, 
whether they be found in reports of “effective” 
or “ineffective” incidents. This research does 
not tell us how much OCS training contributes 
to causing effective or ineffective incidents. 


Determining the Gross Relevance of OCS 
Curriculum to Duty Incidents 


Basic Data 


An earlier study (6) had accumulated in- 
cidents involving destroyer officers from all 
training sources. Of those, i,073 incidents 
reported on general line ensigns constituted 
our pool of basic data. This sample of in- 
cidents is used here to provide a description 
and operational definition of the critically sig- 
nificant elements of the duties of new officers 
aboard destroyers. Incidents were typed on 
IBM cards to facilitate handling and analysis. 


Analysis of the Incident Pool with Reference 
to Curriculum Areas 


The incident cards were submitted to sub- 
ject matter specialists, OCS officer-instructors, 
who carried out a review sequence that ul- 
timately required that each incident be identi- 


329 





330 


fied with a lesson in the lesson-plan of one of 
the seven sections of the curriculum: 1. Sea- 
manship, 2. Orientation and Military Justice, 
3. Navigation, 4. Operations, 5. Naval Weapons 
and Fire Control, 6. Engineering and Damage 
Control, and 7. Military. Category 8, labelled 
“Other,” was established for those incidents 
which reflected motivation and interest in the 
job rather than the possession of specific skills 
or knowledges. 

The instructional set for sorting effective 
incidents concerned the curricular locus of 
skills, knowledges and attitudes shown in each 
incident. For the ineffective incidents the 
emphasis was on the skills, knowledges and 
attitudes which, if learned, could have pre- 
vented the ensign from getting involved in the 
incident. 

Within each subject matter category, in- 
cidents were classified in two ways: “Taught” 
—the relevant skills, knowledges and attitudes 
are currently the subject of specific lessons of 
instruction; and “Not Taught”—pertains to 
material not covered, but which logically be- 
longs in one of the school sections and prob- 
ably would be taught were time and facilities 
available. All “Other” incidents were, of 


course, of the “not taught” variety. 

No assumption can be made that individual * 
incidents, or even incidents of a given sort, 
are equally significant from the operational 


standpoint. Yet, if a large number of the 
critical incidents reported fall into a given sub- 
ject matter area, it is difficult to escape the 
conclusion that there is much important ac- 
tivity aboard ship to which that area of in- 
struction is relevant. Likewise, if the number 
of incidents in certain instructional areas is 
small, unless it is found that the incidents are 
individually of vital significance, it would 
seem that the learnings in such areas are not 
heavily drawn upon in the performance of a 
new officer’s duties. 

It appears irom Table 1 that it is from his 
background in the courses Orientation and 
Military Justice, Seamanship, and Operations, 
that on reporting to a destroyer the new 
ensign might expect to draw most frequently, 
while being less likely to be called upon to 
make use of what he learned in Navigation, 


Albert S. Glickman and T. R. Vallance 


Engineering and Damage Control, and Naval 
Weapons and Fire Control.” 

Can the fact that 62.9% of the classifiable 
incidents are said by instructor-judges to be 
taught at OCS be considered to reflect a 
“good” state of affairs? Since the limits im- 
posed by time and facilities have not been 
considered and since there is no standard of 
what “percentage taught” can be considered 
good, this question cannot be answered in an 
absolute sense. However, we might say that 
the higher the “percentage taught” the better. 

Thus, it might be presumed that of the six 
academic courses, Seamanship and Orientation 
and Military Justice most completely cover 
their respective areas, and that the Naval 
Weapons course least adequately treats of the 
kinds of situations that destroyer officers re- 
port as critical. However, there are instances 
where 50 or more incidents are considered 
relevant to one hour of instruction, leading one 
to discount the direct interpretation of “per 
centage taught” as a sufficient criterion of the 
adequacy of subject matter coverage. 

References to the number of incidents as- 
signed to specific hours of instruction, along 
with further examination of the contents of 
sessions by subject matter experts, would seem 
to provide a more meaningful employment of 
data such as these for purposes of curriculum 
analysis. 


Developing an Index of Incident-Behavior 
“Importance for Early Usefulness” 


Rationale 


The main objective of the next phase was 
to obtain expert judgments regarding the sig- 


2 Some incidents were considered relevant to more 
than one lesson. However, in Table 1, no incident 
was counted more than once, since when an incident 
was identified with more than one lesson, these les- 
sons were always in the same curriculum section. 
The judges could not classify 17 of the original 1073 
incidents. 

8If the “percentage taught” figures for effective 
and ineffective incidents for the six academic courses 
(all except Military) are arranged in rank order, the 
correlation between ranks for “effectives” and “in- 
effectives” is .79, suggesting that, whether relevant 
effective or ineffective incidents are used, the relative 
coverage by courses is much the same. Table 1 also 
reveals that overall the “percentage taught” is not 
much different for effective and ineffective incidents 
(64.6% vs. 62.1%). 





Curriculum Assessment with Critical Incidents 


Table 1 


Results of Sorting Critical Incidents by Subject Matter Area: 


Effective Ineffective Summary 


N % N %* N % 





. Seamanship 
Taught 52 48 9.9 fs 14.7 
Not-Taught 1.1 3.7 4.8 
Total 210 19.6 
% Taught> ’ 

Orientation and Military Justice 

Taught 
Not-Taught 
Total 
% Taught> 

. Navigation 
Taught 
Not-Taught 
Total 
% Taught” 

. Operations 
Taught 
Not-Taught 
Total 
% Taught 

. Weapons 
Taught 
Not-Taught 
Total 
% Taught 

. Engineering 
Taught 
Not-Taught 
Total 
% Taught 

. Military 
Taught 
Not-Taught 
Total 
% Taught» 

. Other 
Taught 0 ' 0 
Not-Taught 26 , 
Total 26 . 3 s 63 
% Taught 0.0 . 0.0 

Total 

Taught 224 41.0 664 
Not-Taught 123 11.5 269 25.1 392 
Total 347 32.3 709 66.1 1056 
% Taught 64.6 62.1 62.9 


* Percentages in these columns are based on the original total number; V/1073 rounded to nearest tenth, 

» Percentages in these rows represent the number of ‘‘taught"’ incidents in relation to the total number of incidents within 
respective subject categories (i.c., taught" /“‘total’’). 

© Seventeen incidents (1.6%) were incapable of being judged with confidence and are not included in this summary. 








332 Albert S. Glickman 


nificant aspects of duty for which training 
should prepare the new ensign. 

To place the notion of importance clearly in 
the framework of training, it was reasoned 
that to the extent that preduty training pro- 
vides the trainee with skills, knowledge, or 
attitudes required of him in the first few 
months of duty, or a sound background on 
which experience will develop them, that train- 
ing is successful. To the extent that it con- 
centrates on that which the trainee will not be 
called upon to use until after he has had a 
year or more of duty, or has been sent to 
other schools to acquire or reacquire them, 
initial training is being misdirected. From this 
standpoint, the immediacy of the opportunity 
or requirement to handle any given situation 
would be an index of its “importance.” It is 
in this sense that “importance,” standing for 
“importance for early usefulness,” is used in 
this article. 


Procedure 


A questionnaire, the Junior Officer Training Re- 
quirements Checklist (JOTRC), was prepared in 10 
forms containing an aggregate of 985 items. The 
final format evolved from four preliminary try-out 
versions. 

Each form listed approximately 100 incidents, and 
was divided into two parts. Part I contained ef- 
fective incidents and Part II ineffective incidents for 
Forms 1, 3, 5, 7, and 9. For Forms 2, 4, 6, 8, and 
10, the order was reversed. About one hour was 
needed to fill out a form. The incidents related to 
all OCS instructional areas, and included those judged 
earlier to be among the “not taught” as well as those 
included in the curriculum. 

The specific questions were: 

1. For the situations involving effective incidents: 
“How soon after reporting aboard would you expect 
the new officer to be able to do this?” 

2. For the situations involving ineffective incidents: 
“How soon after reporting aboard would you expect 
the new officer to be able to handle this situation 
satisfactorily ?” 


The directions specified the writing of the num- 
ber of months (directly after being commissioned), 
ranging from 0 (for judgments that the reserve officer 
should be able to perform well immediately on re- 
porting aboard) to 12. “X” was to be used if the 


* This number is smaller than that involved in the 
two instructor sorts because of deletion of incidents: 
(a) of clear duplication; (6) which might be mis- 
interpreted by the judges because the situations rep- 
resented contraventions of Navy policy (e.g., a num- 
ber of incidents originally submitted as “effectives,” 
turned out to be violations of regulations). 


and T. R. Vallance 


respondent would not expect the new officer to be 
able to handle the given type of situation satisfac- 
torily until after he had been aboard for more than 
a year. “N” was to be used if the reporter felt un- 
able to make a judgment because of wording or in- 
terpretation of the incident items.® 

Two copies of the checklist were sent to each of 
170 destroyer-type ships, one for the commanding 
officer (CO) and one for his executive officer (XO). 
Each of the 10 forms was sent to 30 to 50 officers. 
A total of 301 JOTRCs were returned in time to be 
analyzed. 

For each of the 985 incidents, the estimates ob- 
tained from COs and XOs were tallied and a median 
value was found. We call these medians TESP 
(“time expectancy for satisfactory performance’) 
values. 


Analyses of Results 
Reliability of TESP Values 


Using Form 8 of the JOTRC, a reliability 
estimate for the CO-XO TESPs was obtained. 
The 44 questionnaires returned (out of 50) 
were divided in two groups of 22 each. For 
each item two TESPs were obtained.® Cor- 


relation of the two sets of TESPs over the 95 
items gave a coefficient of .93. 


Distribution of TESP Values 


The distribution of TESPs for the 985 
JOTRC items was divided into incidents of 
“effective” (E) and “ineffective” (I) per- 
formance. 

There was a distinct difference in TESPs as- 
signed to E and I performances (median values 
of 6.0 and 2.5 months, respectively). The of- 
ficers responsible for operating these ships 
were most immediately concerned with keep- 
ing the novice (and themselves) out of trouble, 
and were content to wait longer for a quality 
of performance exceeding what is normally 
considered satisfactory. Overall the TESPs 
tended to pile up at the bottom (more “im- 
portant”) end of the scale (Table 2). 

These estimates seem to bear out assertions 
that destroyer duty requires of an officer 
rapid learning and assumption of responsi- 

5 The results to be presented must be interpreted 
in terms of the status quo at the time of this investi- 
gation. To the extent that the operational, training, 
or personnel factors in the Navy change conditions, 
the values obtained might be altered. 

6In all computational work the “X” response, 
meaning “more than 12 months,” was assigned a 


value of 13; the lower the value, the greater the 
relevance of the incident to the OCS curriculum. 





Curriculum Assessment with Critical Incidents 


Table 2 


Frequency Distribution of Incidents by Curriculum Area, Subcategorized as “Taught” and “‘Not Taught,” 


With Mean of TESP Values for the Incidents in Each Subcategory Indicated 








Taught 





No.of %of 
Inci- _Inci- 
dents dents* 


Mean 


Curriculum Area TESP 


Not Taught 





No. of 
Inci- 
dents 


% of 
Inci- 
dents* 


Mean 
TESP 


Mean 
TESP 


Inci- 
dents* 


Inci- 
dents 





. Seamanship 

. Orientation and 
Military Justice 

. Navigation 

. Operations 

. Naval Weapons 

. Engineering and 
Damage Control 3.0 

. Military 0.2 

. Other - - 


16.3 4.34 
34.2 
0.9 
8.8 
1.4 


2.73 
5.13 
4.48 
7.30 


5.46 


Summary 568 64.7 3.62 


46 189 21.5 4.32 
99 399 
1 : ; 9 


67 V44 


45.4 
1.0 
16.4 
5.7 


2.78 
4.78 
4.72 
6.51 


38 . t 50 


18 , 5. 5.0 


2 . J 0.5 
39 4.4 


5.48 
4.00 
4.32 


310 100.0 3.87 


based on 878 incidents; N/878. Eliminated, because more than 10% of respondents omitted an answer or 


* Percentages 
indicated that they could not clearly ee the incident, were 91 of the original 985 items. 
relevance were 16 incidents. 


not be judged with confidence as to area 


bility involving manifold skills and knowl- 
edges, even allowing for the fact that all 
ensigns are not expected to become competent 
in all of the 985 incident situations.’ The fact 
that the COs and XOs expected the ensigns to 
be able to handle a sizable majority of the 
situations after only a few months on board 
seems to lend corroboration to the notion that 
the over-all adequacy of the curriculum is, in 
general, proportionate to the coverage given 
to these situations, even allowing for differ- 
ences in TESPs of different incidents. In 
other words, if the relevant skills, knowledges, 
and attitudes are “taught,” the curriculum 
conforms to CO—-XO judgments. If these are 
not taught, COs and XOs say, in effect, they 
ought to be. 


TESP Values of Incidents Assigned to Subject 
Matter Areas 


The next analyses deal with the relative 
“importance” of incidents assigned to differ- 
ent subject matter areas and the relation be- 
tween the number of incidents found to be 


TAs indicated in the questionnaire instructions, 
the respondent is required to “Suppose [with respect 
to each incident] that his [the ensign’s] responsibili- 
ties as an officer will give him the opportunity to 
handle the situation as shown.” 


Not included because they could 


relevant to a given area and the TESP values 
assigned to those incidents. 

In order to provide a basis of interpretation 
least vitiated by ambiguity, the following 
analyses were restricted to 878 of the original 
985 incidents included in the 10 JOTRC 
forms. Those 91 items, which more than 10% 
of respondents did not answer or indicated 
that they could not clearly interpret, were de- 
leted. Sixteen items, which the OCS experts 
had not been able to assign with confidence 
to relevant subject matter areas, were also 
removed. The numbers of incidents “taught” 
and “not taught” in the six academic courses 
and in military training were counted, and 
the mean TESP was calculated for each of 
these categories and subcategories, as well as 
the “Others,” as shown in Table 2. Among 
the academic courses there is a close cor- 
respondence between the number of incidents 
said to be relevant to a given course by OCS 
instructors and the mean TESPs derived in- 
dependently from the judgments of “impor- 
tance for early usefulness” by the COs and 
XOs. The three academic courses which em- 
brace the largest number of incidents are also, 
in the same order, the courses which are the 
most “important” as indicated by the mean 








334 


TESP values. Of the six, only the courses in 
Navigation and Naval Weapons, the former 
with a small number of cases, are out of order 
with respect to these two quantities. Again 
it would appear that there is substantial paral- 
lelism between number and TESP value of 
incidents when sorted into OCS curriculum 
areas. 

Some indication of the curricular validity of 
OCS academic offerings is shown by the fact 
that the mean TESP for material “taught” is 
lower (more important)} than for material “not 
taught.” However the caution introduced 
earlier, because of the large proportion of in- 
cidents said to be related to the material 
taught in a relativelv small number of sessions, 
still applies. 

We have gone one step further and pursued 
the analysis of the 568 incidents said to be 
“taught” in the sample of 878 incidents. We 
determined the mean TESP for the incidents 
assigned to each instructional session to which 
one or more incidents were judged to be rele- 
vant. A distribution of these 157 sessions 
against mean TESPs of incidents assigned to 
each session made it apparent that the lessons 
covering the largest number of relevant in- 
cidents also tend to be considered more “im- 
portant for early usefulness” aboard ship. 
Using 28 lessons for which somewhat more 
reliable mean TESPs are available (based on 
10 or more incidents) we again established a 
fair correspondence between “number” and 
“importance” of incidents in a given category. 
The correlation for these 28 lessons was .65.° 

An inspection was made of the content of 
these 28 lessons. It supports the view that of 
all his background, his skills in human rela- 
tions will be those which a new officer will be 
most immediately and most frequently called 
upon to demonstrate. It appears that the 
first thing a junior officer has to learn is how 
to manage his relations with contemporaries 
and seniors. Leadership and management of 
subordinate personnel show up as next most 
“important” skills. Demands upon his tech- 
nical skills and knowledges are not made until 
later. 


8 The sign of the coefficient has been reversed here, 
to account for the fact that the more “important” 
items have lower TESPs, and thus permit direct read- 
ing and interpretation. 


Albert S. Glickman and T. R. Vallance 


Discussion 


It has been pointed out that the general 
purpose of OCS training is to develop officers 
with general familiarity in subject matter 
areas containing technical problems and with 
ability to supervise technicians, not technical 
specialists. It would appear that officers re- 
sponsible for operating destroyers are in gen- 
eral agreement with that emphasis. They are 
perhaps inclined to give greater emphasis to 
the requirement that he be able to take up 
quickly, and carry on satisfactorily, responsi- 
bilities requiring “know-how” in dealing with 
people rather than technical skill. Critical 
incidents involving such skills are also, by 
far, the kind most frequently reported about 
ensigns. 

Other uses of the results are available. For 
example, we have concentrated comment on 
the curriculum areas or lessons most fre- 
quently the subject matter of relevant inci- 
dents and to which most “importance” is at- 
tributed. We have not spoken much of the 
lessons involving relevance with none of our 
pool of incidents or those of low “importance.” 
Obviously, these would be examined first in 
terms of their contribution to the training 
program and for the possibility of using the 
time given them for instruction on some of 
the incident-behaviors which were highly con- 
centrated in.a few sessions of the program 
studied (e.g., leadership). 

The scope of this research has been limited. 
Although the feasibility of the method has 
been demonstrated by embracing the whole 
curriculum as the field of inquiry, insufficient 
data could be brought to bear on many areas. 
More concentrated attacks can be made upon 
specific curriculum problems, using the same 
procedures. For instance, if it were desired 
to revamp the Navigation offerings, critical 
incidents involving navigation could be re- 
quested from officers on ships and then sorted 
by lessons of relevance. Since number and 
“importance” of incidents tend to be associ- 
ated, it might be possible to eliminate the de- 
termination of TESPs, if one were willing to 
sacrifice some precision. 

Naturally, if any considerable changes in 
the curriculum were to be made, follow-up 
studies should be conducted to see if the de- 





Curriculum Assessment 


sired changes in on-the-job performance came 
about. 

In this exploratory research we have not 
concerned ourselves with such matters as the 
policies, directives, costs and laws governing 
operation of a school, nor with the time, staff, 
techniques, or facilities, necessary or available 
to achieve satisfactory levels of competence 
through training in the various aspects of a 
curriculum. However, they are within the 
purview of the administrative authorities gov- 
erning a training program. 

The case in point here has been the Navy’s 
Officer Candidate School. However, the ap- 


plicability of the methodology to other train- 
ing and educational settings is apparent. 


Summary 


This research sought to identify those as- 
pects of the Navy OCS curriculum which 
were most and least relevant to duties of 
newly commissioned ensigns aboard destroy- 
ers; thus to provide information useful in im- 
provement of training. 

More than 1,000 critica! incidents of ef- 
fective and ineffective performance by de- 
stroyer ensigns were available from earlier 
research. These were sorted according to the 
OCS curriculum area to which each was most 
relevant. The “relevant” incidents assigned 
to each area, were then classified as: (a) 
“Taught”—currently the subject of specific 
lessons of OCS instruction, and (b) “Not 
Taught”’—pertains to a subject matter area in 
the curriculum, but not covered due to time or 
facility limitations. 

A Junior Officer Training Requirements 
Checklist was sent to 340 commanding and 
executive officers of destroyer-type vessels, 
and was completed by more than 300 of them. 
The checklist was prepared in 10 forms, each 
containing approximately 100 incidents. Each 
form was sent to 30 to 50 officers, with in- 
structions to make a judgment for each in- 


with Critical Incidents 335 
cident as to: “How soon [in number of 
months] after his reporting aboard [directly 
after being commissioned], under normal con- 
ditions, would you expect the new [reserve | 
officer to be able to handle the situation to 
your satisfaction?” From their answers an 
index of “time expectancy for satisfactory 
performance” was determined for each in- 
cident with high reliability. It was assumed 
that the sooner the ensign is expected to 
handle a situation satisfactorily, the more “im- 
portant” it is that the relevant material be 
learned at OCS. 

The findings indicate that the new ensign 
most frequently and most immediately will be 
called upon to draw on background relevant 
to human relations, leadership, and personnel 
administration skills; technical skills are ex- 
pected to be developed later. 

Suggestions are made for utilization of the 
current findings and directions for further re- 
search. 


Received January 9, 1958. 


References 


. Flanagan, J. C. Research techniques for develop- 
ing educational objectives. Educ. Rec., 1947, 
28, 139-148. 

. Flanagan, J. C. The critical requirements ap- 
proach to educational objectives. Sch. & Soc., 
1950, 71, 321-324. 

. Flanagan, J. C. The critical incident technique 
Psych. Bull., 1954, §1, 327-358. 

. Glickman, A. S., & Vallance, T. R. An exploratory 
study of the applicability of critical incident 
techniques to the assessment of curricula for 
officer candidate training. USN Bur. Naval 
Pers. Res. Tech. Bull., 1954, No. 54-23. 

. Mahler, W. R., & Monroe, W. H. How industry 
determines the need for and effectiveness of 
training. USA Personn. Res. Sect. Rept., 1952, 
No. 929. 

. Vallance, T. R., Glickman, A. S., & Vasilas, J. N. 
Critical incidents in junior officer duties aboard 
destroyer-type vessels. USN Bur. N .val Pers. 
Res. Tech. Bull., 1954, No. 54-4. 








Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


A Personality Inventory Employing Occupational Titles * 


John L. Holland 
National Merit Scholarship Corporation 


In the last two decades the research litera- 
ture concerning the meaning of occupational 
choice—the relation of vocational interests to 
attitudes and personality—has implied with 
increasing clarity that a useful personality in- 
ventory might be constructed from occupa- 
tional or interest test content (3, 4, 12, 15). 
The present experimental inventory represents 
a further testing and application of this hy- 
pothesis. 

Essentially, the Holland Vocational Prefer- 
ence Inventory is a personality inventory 
which uses occupational titles for content. It 
is constructed to give a maximum amount of 
reliable and valid information with a minimum 
amount of testing and scoring time, skill, and 
expense. Many variables usually included in 
interest and personality tests have been in- 
tegrated in the HVPI. These variables are 
assumed to yield a broad range of information 
concerning the S’s personal adjustment, values, 
attitudes, and vocational motivation. The in- 
ventory is self-administering. The Ss record 
their “feelings and wttitudes” about occupa- 
tional titles by indicating either “interest” 
(appeal) or “dislike’ for each of the 300 
items. 

Because of the relative neutrality of its 
title and occupational content. the inventory 
appears to have a number of uniquely de- 
sirable qualities. Occupationa! titles provide 
a subtle set of stimuli which minimize the 
negative reactions sometimes provoked by 
more obvious personality inventories, reduce 
the S’s need to “fake” since the inventory is 
perceived as a “vocational” inventory, and 
virtually eliminate those requests for per- 
sonality interpretations which usually follow 
on the heels of the administration of other 
personality inventories. 


1I wish to acknowledge the support and encourage- 
ment of numerous colleagues and students at Western 
Reserve University, VA Hospital, Perry Point, Mary- 
land, and the University of Maryland, who have 
made this study possible. 


336 


Rationale 


The development of the inventory has been 
guided by a psychological rationale integrated 
from a number of divergent fields: psychology, 
psychiatry, test theory, and sociology. The 
aim of this formulation is to provide a theo- 
retical framework for using and interpreting 
the inventory, and to present a means of 
extending and clarifying its construct validity. 
The following assumptions summarize this 
tentative rationale: 

1. The choice of an occupation is an expres- 
sive act which reflects the person’s motivation, 
knowledge, personality, and ability. Occupa- 
tions represent a way of life, an environ- 
ment rather than a set of isolated work func- 
tions or skills. To work as a carpenter means 
not only to use tools but also to have a cer- 
tain status, community role, and a special 
pattern of living. In this sense, the choice of 
an occupational title represents several kinds 
of information: the S’s motivation, his knowl- 
edge of the occupation in question, his insight 
and understanding of himself, and his abilities. 
In short, item responses may be thought of as 
limited but useful expressive or projective 
protocols. 

2. The interaction of the person and his 
environment creates a limited number of fa- 
vorite methods for dealing with interpersonal 
and environmental problems. The various 
HVPI scales are assumed to measure some 
of these common or favorite methods of ad- 
justment. Translated into scale terms, peaks 
reveal the person’s favorite methods whereas 
low points indicate the rejected methods of 
adjustment. Or, peaks may represent desir- 
able roles and situations, and low points, 
threatening or distasteful roles and situations. 

The foregoing assumption is predicated by 
another assumption; namely, that the various 
classes or occupational groups furnish dif- 
ferent kinds of gratifications or satisfactions 
and require different abilities, identifications, 
values, and attitudes. This particular hy- 





A Personality Inventory 


pothesis has extensive empirical support from 
studies relating “vocational interests” to per- 
sonality variables, psychiatric status, values 
and attitudes. Typical studies in this area 
include those of Berdie (1), Darley (2), Forer 
(3), Garman (4), Gough (5), Laurent (7), 
Sternberg (11), Terman (13), and Weir (15). 

3. The development of adequate adjustive 
techniques requires accurate discrimination 
among potential environments. The ability 
to discriminate potentially satisfying and 
beneficial environments from potentially dis- 
satisfying and unhealthy environments is im- 
perative for creative health. In this sense, the 
inventory is a miniature performance test of 
the S’s understanding of his surroundings in 
relation to himself; that is, the choice of an 
occupational title is a measure of the S’s in- 
sight and understanding as well as a sign of 
his motivation and his intellectual comprehen- 
sion of the occupation in question. This third 
assumption has two corollaries which seem 
useful for interpreting “responsiveness” to the 
inventory. 

3a. The total number of preferred occupa- 
tions is a function of a number of personality 
variables. These hypothesized variables are 
supported by some of the evidence for the 
validity of the inventory, and by several scat- 
tered studies reported by Berdie (1), Weir 
(15), and others. 

Specifically, the total number of preferred 
occupations is a function of dependency, ag- 
gressiveness, mood, degree of cultural intro- 
ception, self-control, sociability, and defensive- 
ness. Over-responsiveness suggests a lack of 
adequate discrimination which may be re- 
flected in dependence, aggression, euphoria, 
over-intraception of the culture, impulsivity, 
sociability, frankness. In contrast, under- 
responsiveness appears indicative of greater 
independence, passivity, depression, rejection 
of the culture, over-control, withdrawal, and 
defensiveness. 

3b. The inability to make discriminations 
among occupations is indicative of conflict 
and disorganized self-understanding. Just as 
the inability to make everyday decisions is a 
result of conflicting motivations, so the in- 
ability to make positive or negative choices of 
occupations (environments) within the in- 


337 


ventory is a sign of conflict. In this sense, 
conflict is defined as divergent, inaccurate, or 
irreconcilable views about one’s abilities, 
needs, and sources of gratification; and is ac- 
companied by the chronic emotional upset 
which results from such conflict. In test 
terms, inability to make choices is reflected in 
the total number of omitted items. 

4. Interest inventories are personality in- 
ventories. Interest and personality inventories 
are identical in principle and provide similar 
information about the person, although their 
content is quite diverse. Both kinds of in- 
ventories reveal how the S perceives himself 
and his milieu. 


Construction of the Inventory 


Preliminary Forms 


The first form of the inventory was estab- 
lished by constructing eight a priori scales: 
Physical Activity, Intellectuality, Responsi- 
bility, Conformity, Verbal Activity, Emotion- 
ality, Reality Orientation, and Acquiescence. 
These scales were devised by reviewing the 
interest and vocational choice literature with 
reference to personality factors, and by using 
this information to create personality formula- 
tions for each scale. With the formulations 
as a guide, the author constructed the scales 
by selecting occupational titles which pre- 
sumably epitomized the scale rationale. For 
example, the Physical Activity scale consists 
of occupational titles which imply motoric 
activities and skills. Typical titles in this 
scale include machinist, North Woods guide, 
forest ranger, electronic technician, etc. Scales 
are scored by counting the number of “pre- 
ferred” occupations in a given scale. 

The second form consists of a revision of 
the first form by means of an internal con- 
sistency analysis of the individual scales and 
of six added scales, forming a 14-variable 
test. The additional scales were Control, Ag- 
gressiveness, Mf, Status, Heterosexuality, and 
Infrequency. Next, all scales were intercor- 
related and a cluster analysis was performed 
in order to eliminate scales and clarify scale 
meanings. This analysis suggested that 10 
scales were sufficiently independent to merit 
retention and further revision. 








338 


Present Form 


The third form was secured by performing 
an internal consistency analysis for those 10 
scales which survived the cluster analysis of 
the second form. This analysis was accomp- 
lished with two samples of 300 male and fe- 
male college freshmen. The method of analy- 
sis was to compare the upper and lower 25 
per cent of each sample on each scale. 
most discriminating items were then selected 
for the final scales, using the “Kelly Tech- 
nique” nomographed by Lawshe (8). 

The Masculinity-Femininity, Status, Infre- 


quency, and Acquiescence Scales are excep- | 


tions to the construction method outlined 
above. These scales have a direct empirical 
development. The Status Scale was devised 
by reviewing three studies of occupational 
status which used the rank order method (6, 
9, 10). Scale items were derived by making 
all items satisfy the data of all three studies. 
For example, “Physician” occurs as a high 
status choice in each study. The final scale 
was then keyed so that preferences for occu- 
pations of high status and aversions to occu- 
pations of low status result in a high status 
score. 

Construction of the Mf Scale began with a 
comparison of the responses of 170 male and 
female high school students on the second 


The . 


John L. Holland 


form. This scale was then revised by a com- 
parison of two groups of 150 male and female 
college freshmen. 

The Infrequency Scale consists of 25 items 
which are rarely “liked” and 25 items which 
are rarely “disliked.” It was constructed 
from frequency counts of the 300 test items 
for samples of 119 male and 100 female col- 
lege freshmen. The 25 most popular and 25 
least popular items were selected from each 
sample to form scales by reversing the scor- 
ing; that is, popular items are scored “dis- 
like” and unpopular items are scored “like.” 
Accordingly, high scores represent unusual 
or unpopular choices, whereas low scores 
represent popular choices. 

The Acquiescence Scale provides an esti- 
mate of the S’s acquiescence, his tendency to 
respond “Appeal” or “Like.” It is scored by 
counting the number of “Appeal” responses 
for Items 1 through 30. Although a more ac- 
curate estimate of acquiescence can be ob- 
tained by counting the “Appeal’’ responses 
for all test items, Items 1—30 provide a useful 
and economical estimate since Items 1-30 
correlate .74 with Items 31-300 for males and 
.69 for females. 

The third and present form of the inventory 
contains 13 scales in all: 3 response-set scales 
and 10 personality scales. The response-set 


Table 1 
Scale Matrix for Male and Female College Freshmen 











In 


Inf 





16 
05 
05 
70 
—39 
26 
20 
—17 
—20 
65 


15 
05 
03 
—44 
—40 
05 
42 
02 


—7 


36 
63 
45 
30 
12 
17 
45 
37 


39 
30 
—05 
07 
54 


09 
72 
39 


—34 —58 





Note.—-+; of .30 is significant at .05 level. 
in upper right 


Intercorrelations for males are shown in lower left triangle of matrix; for females, 


* Since the Physical Activity items for females are homogeneous with the Intellectuality items, they are included in the latter 
scale; consequently, the Physical Activity scale is omitted in the female form of the inventory. 





A Personality Inventory 


scales include the: 1. Question Scale, or the 
number of omitted items, 2. Infrequency 
Scale, and 3. Acquiescence Scale. The re- 
maining scales are: 1. Physical Activity, 2. In- 
tellectuality, 3. Responsibility, 4. Conformity, 
5. Verbal Activity, 6. Emotionality, 7. Con- 
trol, 8. Aggressiveness, 9. Masculinity-Femi- 
ninity, and 10. Status. 

To secure two equivalent parts or tests, 
each scale has been halved and the halves 
separated to form two tests. This procedure 
establishes a useful format for determining re- 
liability and presents an opportunity for de- 
termining the value of a short form. Table 1 
shows the scale interrelationships for college 
male and female samples of 100 each. For 
males, the intercorrelations range from —.60 
to .83, and average .36. For females, r’s 
range from —.44 to .64 and average .26. 


Age and Intelligence * 


The relation of HVPI scales to age and in- 
telligence has been tested, since it is desirable 
for interpretative purposes to have the in- 
ventory relatively independent of these vari- 
ables. Heterogeneous samples of high school 
and college students and of employed adults 
were combined to increase the age range in 
correlating age with HVPI scales. Males 
range in age from 15 to 77; females range 
from 17 to 60 years. In general, these rela- 
tionships are either insignificant or of a low 
order. The correlation of —.64 between Con- 
trol scale and age for females is the exception 
to these findings. 

The relation between intelligence and HVPI 
scales was estimated by correlating the Won- 
derlic Personnel Test, a revision of the Army 
Alpha, with the HVPI scales; samples of em- 
ployed hospital attendants were used. None 
of the resulting correlations are significant; 
however, the range of Wonderlic scores is 
somewhat attenuated so that significant rela- 
tionships may occur in more heterogeneous 
samples. 


2 Tables summarizing the relation of the inventory 
scales to age and intelligence, and scale reliabilities 
have been deposited with the American Documenta- 
tion Institute. Order Document No. 5689, remitting 
$1.25 for 35-mm microfilm or $1.25 for 6 by 8 in. 
photocopies. 


Reliability * 


The internal consistency of the third re- 
vision was estimated by correlating Part I 
against Part II of each scale, using samples 
of 100 male and female college freshmen. The 
scale divisions are assumed to be equivalent, 
since items were assigned to Part I or II ran- 
domly. For males, these coefficients cor- 
rected for length range from .72 to .95, with 
a median of .85. For females, they range 
from .68 to .90, with a median of .79. 

An estimate of retest reliability was ob- 
tained from data supplied by Walsh (14), who 
tested a sample of 38 TB patients before and 
after lung surgery. The time interval be- 
tween testings averaged about four months. 
The retest correlations range from .70 to .87, 
with a median of .75. 


Validity 


The inventory differentiates significantly a 
number of defined criterion groups which 
represent a wide range of personality differ- 
ences. In general, the obtained differences 
support the construct validity of the inven- 
tory. These findings are summarized in the 
following sections. 


Table 2 


Mean Scale Differences for Matched Controls 
and Psychiatric Patients 
(Age, Occupational Class, and Status) 


Controls 
(N = 100) 


Patients 
(VN = 100) 


Scale xX SD X SD 


Inf* 13.2 59 
Ac 125 58 10.4 69 
PA 150 79 10.9 8&7 
Int 10.5 7.6 8.7 78 
Re 95 62 76 7.3 
cf 10.4 6.7 75° 8 
VA 10.1 6.6 73°77 
Em 8.1 7.9 8.2 82 
Co* 11.6 7.2 13.3 58 
Ag 13.3 84 10.0 89 
Mf 123 44 99 40 
St 10.1 45 99 50 


18.2 7.4 





* Variances significantly different beyond .05 level. 








340 


Table 3 


Distributions of Protile Peaks for Controls 
and Psychiatric Patients 











Controls Patients 
Scale (NV = 183) (N = 186) 
PA 38 23 
Int 5 8 
Re 19 17 
Cf 41 27 
VA 1 1 
Em 12 20 
Co 35 67 
Ag 12 6 
Mf 19 8 
St 1 9 





Note.—For this comparison, the control and patient samples 
have been i by adding unmatched Ss to these groups 
in order to increase the ‘‘ex " cell frequencies. These 
additional Ss closely approximate the matched samples with 
respect to age, status, and occupation. 


Controls and Psychiatric Patients 


For this comparison, male samples of 100 
controls and 100 psychiatric patients were 
matched for age, socioeconomic status, and 
principal occupation. The group distributions 
for these matched variables are not signifi- 
cantly different. _The control group consisted 
of employed and unemployed adults tested in 


John L. Holland 


the process of employee selectien and similar 
personnel studies at a VA psychiatric hospital. 
The patient group included 74 psychotic and 
26 nonpsychotic patients, all without organic 
involvement. Most of them were tested near 
the end of their hospitalization, although a 
few were tested shortly after admission. Table 
2 summarizes the results of the comparison. 

The significant differences in Table 2 are 
in the expected directions; that is, a literal 
interpretation of the scale meanings yields 
descriptions for normals and for psychiatric 
patients which are congruent with stereotypes 
of normality and psychiatric status. These 
differences reveal that normals give common 
responses (Inf), are more responsive (Ac), 
are physically active (PA), conforming (Cf), 
verbal (VA), aggressive (Ag), and masculine 
(Mf). In contrast, patients give infrequent 
or unusual responses (Inf), are less responsive 
(Ac), less active (PA), nonconforming (Cf), 
less verbal (VA), more passive (Ag), and 
more feminine (Mf). 

Similar differentiation is obtained between 
control and patient samples when distribu- 
tions of high point scales are formed. For 
this comparison, individual profiles were clas- 
sified by the highest scale in the profile. The 
resulting distributions for patients and con- 


Table 4 


Mean Scale Scores for Controls, Psychiatric Patients, TB Patients, and Psychopaths 











Control Psychopath TB Psychiatric 

Scale (N = 100) (v = 50) (N = 61) (N = 100) F 

? 25.2 51.5 16.2 30.9 — 
Inf 13.2 14.5 18.9 18.2 13.47*** 
Ac 12.5 10.3 11.3 10.4 2.02 
PA 15.0 12.0 13.0 10.9 3.88** 
Int 10.5 8.9 8.9 8.7 .96 
Re 9.5 6.9 7.0 7.6 3.03* 
Cf 10.4 6.9 9.3 7.5 4.16** 
VA 10.1 7.7 8.1 7.3 2.62 
Em 8.1 9.1 8.5 8.2 18 
Co 11.6 11.2 13.0 13.3 2.09 
Ag 13.3 9.3 10.6 10.0 3.29* 
Mf 12.3 10.4 10.9 9.9 S.S7ee* 
St 10.1 10.0 9.9 9.9 .04 





Note.—Controls and psychiatric patients were matched for age, status, and occupation. Psychopaths and TB patients 


ek Nar as two groups for these variables. Note insignificant differences on status scale. 


> 01. 
wep 


001. 








A Personality Inventory 


Table 5 


Significant Mean Score Differences 


Scale Criterion Groups Sig Level 








Infrequency NP, TB > C, Pd <.001, <.01 
Physical Activity C > NP, Pd <.01, <.05 
Responsibility C > Pd, TB <.05, <.05 
Conformity C > Pd, NP <.01, <.01 
Mf C > Pd, NP, TB <.05, <.001, <.05 
Aggressiveness C > Pd, NP <.01, <.01 





Table 6 


Distributions of Profile Peaks for University of Maryland Colleges Entering Male Freshmen 
(N = 382) 


Arts and 
Science Engineering Business Education Agriculture 
Scales (N = 101) (N = 120) (N = 96) (N = 38) (N = 27) 


Physical Activity 15 33 3 il 1 
Intellectuality 25 39 7 2 
Responsibility 22 5 18 13 
Conformity 9 13 41 6 
Verbal Activity 10 8 15 2 
Emotionality 20 22 12 + 








Entering Female Freshmen 
(N = 119) 





Arts and Home 
Science Economics Business Education Nursing 
Scales (N = 59) (N = 46) (N = 17) (N = 58) (N = 19) 





Intellectuality 14 5 
Responsibility 10 20 
Conformity 14 18 
Verbal Activity 12 10 
Emotionality 9 s 5 1 





Note.—For males, distributions of scale peaks for Arts and Sciences, Engineering, and Business are significantly different 
at the .01 and .001 levels, respectively. For females, distribution for Arts and Sciences, Home Economics, and Education are 
~~ different at the .10, .06, and .01 levels, respectively. For this comparison, only the first six inventory scales were 
em 4 


trols, shown in Table 3, are significant beyond These samples are assumed to have a crude 
the .001 level. comparability with respect to age, socioeco- 
¢ nomic status, and intelligence. The mean ages 
Normals, Psychopaths, Tuberculosis, and Psy- for the four groups range from 27.0 to 33.2 
chiatric Patients years; the four means on the status scale 
The study reported above was extended by range only from 9.9 to 10.1 raw score points; 
comparing the matched control and psychi- there is a similar distribution of principal oc- 
atric samples with two additional samples of cupations in each of the four groups. 
61 TB patients and 50 criminal psychopaths.* 


tentiary. All Ss in this sample were diagnosed by a 
8 The prison psychopaths were selected by Lloyd test battery including, at a minimum, the MMPI, 
B. Calkins, psychologist for the Maryland peni- W-B, and DAP. 








342 John L. Holland 


The group means were tested first by a 
simple analysis of variance; then individual ¢ 
tests were performed for the significant analy- 
sis of variance tests. The results of these 
analyses are shown in Tables 4 and 5. 


College Choice 


An additional study attempted to differenti- 
ate the profiles of university freshmen on the 
basis of their college choice. The sample in- 
cluded groups of male freshmen majoring in 
business, arts and sciences, and engineering. 
For the analysis, 209 profiles were classified 
by high point scale. among the first six scales 
of the second revision and frequency distribu- 
tions formed for each college. ,’ tests re- 
vealed that differences between any pair of 
colleges are significant beyond the .05 level. 

The following year, this test was replicated 
with a sample of 317 freshmen, using the 
third revision of the inventory. Two of the 
differences among scale peak distributions are 
significant at the .001 level and a third at the 
.01 level. In addition, these differences are 


substantial. The engineers, for example, have 
60 per cent of their peaks among the Physical 
Activity and Intellectuality scales, whereas 
the business freshmen have only 10.4 per cent 


of their peaks among these scales. Likewise, 
the business freshmen have 77 per cent of 
their peaks among the Responsibility, Con- 
formity, and Verbal Activity scales, whereas 
the engineers have 22 per cent of their peaks 
among these scales. Table 6 summarizes this 
evidence along with several small samples not 
amenable to statistical test. 


Received January 15, 1958. 


References 


1. Berdie, R. F. Likes, dislikes, and vocational in- 
terests. J. appl. Psychol., 1943, 27, 180-189. 
2. Darley, J. G. A preliminary study of relations 
between attitude, adjustment, and vocational 


' jnterest tests. J. educ. Psychol., 1938, 29, 467- 
473. 

. Forer, B. R. Personality dynamics and occupa- 
tional choice. Amer. Psychologist, 1951, 6, 
378-379. (Abstract) 

. Garman, G. D. The Strong Vocational Inter- 
est Blank as a measure of manifest anxiety. 
Amer. Psychologist, 1954, 9, 372-373. (Ab- 
stract) 

. Gough, H. G., McKee, M. G., & Yandell, R. J. 
Adjective check list analyses of a number of 
selected psychometric and assessment variables. 
Berkeley: Inst. Pers. Assessment and Research, 
Univer. Calif., 1955. (Mimeo.) 

. Hall, C. W. The prestige values assigned to a 
group of 249 occupations by 200 independent 
raters. Western Reserve Univer. (Mimeo.) 

. Laurent, H. A study of the development back- 
grounds of men to determine by means of the 
biographical information blank the relation- 
ship between factors in their early back- 
grounds and their choice of professions. Un- 
published doctoral dissertation, 1951, Western 
Reserve Univer. 

. Lawshe, C. H., Jr. A nomograph for estimating 
the validity of test items. J. appl. Psychol. 
1942, 26, 846-849. 

. National Opinion Research Center. Jobs and 
occupations: A popular evaluation. Opinion 
News, 1947, 9, 3-13. 

. Randall, H. Occupational Prestige. Occupa- 
tional Planning Committee, Cleveland Welfare 
Federation, 1951. (Mimeo.) 

. Sternberg, C. Personality trait patterns of col- 
lege students majoring in different fields. 
Psychol. Monogr., 1955, 69, No. 18 (Whole 
No. 403). 

. Strong, E. K., Jr. Vocational interests of men 
and women. Stanford: Stanford Univer. Press, 
1943. 

. Terman, L. M. Scientists and nonscientists in a 
group of 800 gifted men. Psychol. Monogr. 
1954, 68 (Whole No. 7). 

. Walsh, R. P. Vocational interests, their stabil- 
ity and personality correlates in hospitalized 
tuberculosis patients. Unpublished master’s 
thesis, Univer. Maryland, 1956. 

. Weir, J. R. An attempt to identify vocational 
interest profiles within a neuropsychiatric 
population. Unpublished doctoral disserta- 
tion, Univer. California, 1951. 








Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Length of Work Periods in Visual Research ' 


Miles A. Tinker 


University of Minnesota 


The duration of the task employed in in- 
vestigating visual efficiency has varied consid- 
erably from one reported experiment to an- 
other. In general the tendency has been to 
use relatively short work periods, i.e., one to 
five minutes. The reliability and validity of 
measurement as related to length of work 
period is an important issue. In an earlier 
study by Tinker (3), it seemed that a pro- 
longed reading task, such as 10 minutes, was 
necessary to achieve valid results in certain 
critical comparisons where differences tend to 
be small. One of the difficulties in this ear- 
lier study (3) was that results for brief work 
periods in a 1928 investigation were com- 
pared with prolonged work periods in the 
1955 report. It now seems obvious that such 
comparisons should be made for work pe- 
riods by the same group of subjects. 

In the 1955 study, Tinker (3) did show 
that the results were the same in comparing 
italic with roman lower case printing in each 
of three successive 10-minute work periods 
and in a 30-minute work period. Similar 
trends were found in comparing lower case 
with all-capital printing in each of four suc- 


cessive four-minute work periods and in a 16- 


minute work period. The italic was read sig- 
nificantly slower than the roman and the all- 
capital material was read significantly slower 


than lower case printing. Reliability of meas-: 
urements was fairly high: a mean of .84 for 


one and .87 for the other study. These re- 
sults, therefore, indicate that 10 minutes is as 
satisfactory as 30 minutes in the former com- 
parison and that four minutes is as satisfac- 
tory as 16 minutes in the latter comparison. 
It is now desirable to explore results obtained 
for short work periods, ie., from 1} to 10 
minutes. 

The present experiment was designed to in- 
vestigate the use of relatively short work pe- 

1 The writer is grateful to the University of Minne- 


sota Graduate School for a research grant to finance 
this study. 


riods in one type of visual research. It ap- 
pears that the length of work period that 
needs exploring ranges from 14 to 10 minutes. 
In the study (3) cited above it is clear that 
10 minutes was as satisfactory as longer pe- 
riods in one comparison (italics vs. roman), 
and, similarly, 4 minutes was satisfactory in 
the other (all-capitals vs. lower case). The 
present experiment is concerned with speed of 
perception in reading under various levels of 
illumination. 


Materials and Procedure 


Tinker’s Speed of Reading Test (2) was employed 
Since comprehension is constant, the test measures 
virtually speed of perception in reading as a single 
variable. The two forms of the test are approxi- 
mately equivalent. Both were set in regular (ro- 
man) 10-point Excelsior type face with 2-point lead- 
ing in a 20-pica line width on eggshell paper stock. 
The 180 university sophomore Ss were tested indi- 
vidually in a light laboratory which provided well- 
dispersed indirect illumination. In Group A, the 
Control Group, Forms I and II of the test were 
given under 25 foot-candles of light. In Group B 
Form I was given under 5 foot-candles and Form II 
under 25 foot-candles of light. And in Group C Form 
I was administered under 25 and Form II under 200 
foot-candles of light. The time limit was 10 minutes 
for each form, with marks on the test to indicate the 
end of 14 and 5 minutes of work. Successive S’s for 
Groups A, B, and C were systematically rotated, i.e., 
S No. 1 was put in Group A, No. 2 in Group B, 
No. 3 in Group C, No. 4 in Group A, etc. 


Results and Discussion 


The results are given in Table 1. Inspec- 
tion of the table reveals the following trends: 
(a) The reliability of measurement is high, 
varying from .82 to .97 with a median coeffi- 
cient of .95. (6) In Group B the results show 
that similar trends occurred with 14, 5, and 
10 minutes of reading, i.e., there was no sig- 
nificant difference in performance under 5 vs. 
25 foot-candles of light. (c) In Group C there 
was no significant difference in performance 
under 25 vs. 200 foot-candles of light with 
a work period of 1} minutes. With the 5- 
minute and 10-minute work periods, the read- 


343 





Miles A. Tinker 


Table 1 


Effect of Illumination on Speed of Reading With Varying Lengths of Reading Time 
(N = 60 in each test group, 180 in all) 








Test Form 


Diff. Between 
Means in 


Time 
Limit 


and 
Foot-candles 


Mean 


SD 


Para- 


graphs* 


Per 
Cent 





(2) 


(4) 


(5) 


(6) 


(7) 


First 25 13.62 4.05 0.0 0.0 
1} min. 25 14.12 4.02 


46.77 
46.62 


First 25 
5 min. 25 


11.28 0.0 
12.09 


First 25 
10 min. 25 


94.97 
95.42 


22.97 0.0 
26.36 


First I 5 12.87 3.21 
1} min. 25 13.62 3.19 


First RR 
5 min. 25 


+1.9 


44.27 9.58 
44.30 9.99 


+0.4 .92 


First 3 


: 89.80 
10 min. II, 25 


90.20 


20.58 —0.1 95 
21.58 


First z, 2 
1} min. II, 200 


14.40 3.74 —0.7 86 
14.80 4.34 


Cc 


Cc 


First 

5 min. 
First 
10 min. 


I, 25 
II, 200 


I, 25 
II, 200 


48.92 
47.37 


99.63 
97.02 


10.57 
12.49 


22.71 
25.60 


— 1.40 


—3.07 


—2.9 


—3.1 


95 


97 


2.55 


3.55 





* The differences in Column 6 are ‘‘corrected"” by the amount of the difference between the mean scores of Form I and Form II 
in Test Group A which serves as a control group. The ‘‘corrections’’ amount to —0.50 for the first 14 minutes, +0.15 for the 
first 5 minutes, and —0.45 for the 10 minutes. Original computations were carried to 4 decimal places. 


ing was retarded significantly under the 200 
foot-candles of light, at the 2 and 1 per cent 
levels, respectively. 

The results presented here suggest that 14 
minutes of reading is just as satisfactory as 
the longer periods of 5 and 10 minutes for 
5 vs. 25 foot-candles illumination. It might 
be noted that Paterson and Tinker (1), page 
178, found similar trends in typography 
studies, ie., 13, 53, and 10 minutes working 
time all yielded comparable results. For 25 
vs. 200 foct-candles the trend is different as 
noted above. 

The fact that speed of visual perception 
in reading decreased under 25 vs. 200 foot- 
candles for work periods of 5 and 10 minutes 
needs some interpretation. It is suggested 


here that possibly glare was operating to re- 
duce efficiency under the 200 footcandles of 
light. It is possible that the 14-minute work 
period was too short for the glare to decrease 
efficiency but that the glare did become effec- 
tive under the longer work periods of 5 and 
10 minutes. Note in Table 1 that the de- 
crease in efficiency was greater for the 10 
than for the 5 minutes. Similar trends have 
appeared for high light intensities in another 
study not yet published. 


Summary and Conclusions 


1. The purpose of this experiment is to pro- 
vide additional information on the effect of 
length of work periods in visual research. 





Length of Work Periods in Visual Research 


2. Work periods of 14, 5, and 10 minutes 
were used while investigating the relative effi- 
ciency of speed of perception in reading under 
5, 25, and 200 foot-candles of light. 

3. There were 180 university sophomore 
Ss who were tested individually under well- 
controlled indirect illumination. 

4. The three work periods yielded the same 
results for the 5 vs. 25 foot-candles compari- 
son, i.e., there was no significant difference 
in performance with change in illumination. 
With the 25 vs. 200 foot-candles comparison, 
there was no significant difference for the 14- 
minute work period but speed of reading was 
significantly slower under the 200 foot-candles 
for both the 5- and 10-minute work periods. 


345 


It is suggested that glare was having a dele- 
terious effect under the high illumination for 
the longer work periods. 

5. One may tentatively conclude from the 
above results that work periods as short as 14 
minutes may be safely used in studying the 
relation between illumination and visual effi- 
ciency. 


Received January 17, 1958. 


References 


1. Paterson, D. G., & Tinker, M. A. How to make 
type readable. New York: Harper, 1940. 

2. Tinker, M. A. Tinker speed of reading test. Min- 
neapolis: University of Minnesota Press, 1955. 

3. Tinker, M. A. Prolonged reading tasks in visual 
research. J. appl. Psychol., 1955, 39, 444-446. 





Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


Communication Restraints, Group Flexibility, and Group 
Confidence *’” 


Robert C. Ziller 


Fels Group Dynamics Center, University of Delaware 


Interpretations of the results of experiments 
concerned with group problem-solving produc- 
tivity are necessarily complicated by qualifica- 
tions involving unique task demands (7). In 
an attempt to avoid this limitation, the study 
reported here concerns the correlates of a 
postulated fundamental dimension of group 
problem-solving behavior, group flexibility 
(5). More explicitly, the present study ex- 
plores the relationship between selected group 
characteristics and group flexibility opera- 
tionally defined as the ability of a group to 
reorganize to meet the time demands of a new 
situation. In addition, an investigation of 
group confidence as an affective concomitant 
of group performance was undertaken to fur- 
ther the understanding of group problem- 
solving behavior. 


Problem 


In general, it was proposed that the more 
flexible groups are those with more completely 
open communication Systems or groups with 
less communication restraints. 

The following characteristics of the leaders 
and groups were selected as independent vari- 
ables, the F-scale score, the conformity of the 
leader, and the attraction of the group.* 

Leaders with high F-scale scores were ex- 
pected to structure the communication net- 
work of the group around themselves as the 


1 This paper is a condensation of a thesis submitted 
in partial fulfillment of the requirements for the de- 
gree of Doctor of Philosophy, University of Michi- 
gan, 1955. Special acknowledgment must be ac- 
corded W. Clark Trow, Alvin Zander, Leonard 
Berkowitz, and E. Paul Torrance for their insightful 
critical reading of the original manuscript. 

2 This research was carried out under the Air 
Force Personnel and Training Research Center, Lack- 
land Air Force Base, San Antonio, Texas. The opin- 
ions or conclusions contained in this report are those 
of the author. They are not to be construed as re- 
flecting the views or indorsement of the Department 
of the Air Force. 

8 The selection of the variables was also influ- 
enced, in part, by time considerations, since the Ss 
were available for a maximum of one hour. 


focal person, thereby restricting information 
exchange and group flexibility. 

An attractive group is described simply as 
one in which the group members feel secure, 
feel as if they “belong,” know what to expect 
of others and can communicate with them 
without restraint (3). There is evidence (6) 
that members of highly attractive groups tend 
to deviate from the group norms when the 
norm is discordant with reality. In such 
groups, the members are more disposed to 
critically evaluate established procedures in 
the light of new circumstances and evolve an 
organization more in accord with the demands 
of the given situation. 

Finally, it was anticipated that the extent 
to which the leader is willing to modify his 
judgment toward the group norm (conform- 
ity) is related to group flexibility through the 
intervening variables of the relative openness 
of the group’s communication network. It 
was proposed, provisionally, that groups whose 
leaders are moderately willing to modify their 
judgment toward the group norm are best 
prepared to adapt to the requirements of a 
new situation. 


Experimental Procedure 
Subjects 


Altogether, 96 B-29 or B-50 aircrews com- 
prising about 1,000 men were involved in the 
experiment. For the most part each crew 
was composed of 11 men of whom five were 
officers (including the assigned aircraft com- 
mander or leader) and six were airmen. How- 
ever, some of the crews were as small as eight 
men and others as large as 15. All subjects 
had been members of the same crew for a 
minimum of three months. Some members 
had been together for as much as a year. 


Procedure 


Immediately preceding the problem-solving ses- 
sions, the members of a group completed sociometric 


346 





Communication Restraints 


and group attraction questionnaires. Then they 
seated themselves around a rectangular area and the 
general nature of the group task was described. The 
directions read, in part, were: 

“The object of the test is to make as high a crew 
score as possible; that is, the highest score that your 
crew can'produce using any means at your disposal. 
The way you go about this is entirely up to you. 
You may hand in one answer sheet for the crew, an 
answer sheet for each crew member, or any other 
arrangements you wish to make. When two or more 
answer sheets contain answers to the same question, 
they will be averaged to provide a crew score for 
that question.” 

Any questions arising from the group were an- 
swered by rereading the directions. At this point a 
questionnaire purported to be a test of the indi- 
vidual’s ability to estimate accurately the group’s 
performance was administered. Actually, the ques- 
tionnaire provided the data for the measurement of 
group confidence. Then the members were allowed 
one minute in which to examine a sample exercise, 
and then sixteen minutes in which to complete all 
eight exercises comprising the group task. During 
their performance, an observer recorded the group’s 
method of approach to the task. Immediately fol- 
lowing the problem-solving session, the crew mem- 
bers completed the F scale and the “conformity” 
scale.* 


The Group Performance Test 


In the experiment, each crew member re- 
ceived a copy of Test 401-B, one of the tests 
in the Intellectual Talents Battery being de- 
veloped for use in the Air Force by the Per- 
sonnel Research Laboratory, Air Force Per- 
sonnel and Training Research Center, Lack- 
land Air Force Base, Texas. The test booklet 
consisted of eight rather long exercises. In 
each exercise, a problem was raised and 15 
facts presented that had a bearing on the solu- 
tion to the problem. The Ss were instructed 
to choose five of the 15 facts most important 
in reaching a decision on the problem. The 
respondent selected the most important facts 
in deciding whether a certain enemy-held 
island base was being developed as an air 
base or a submarine base, what type of home 
a married officer should buy, what man a 
squadron commanding officer should recom- 
mend for Officer Candidate School, what city 
should be selected for a recruiting office, what 
site a manufacturer should choose for a fac- 


*This was originally a four-item scale used by 
Alvin Zander, Research Center for Group Dynamics, 
University of Michigan, and was used in the present 
study at his suggestion. 


347 


tory, which of two devices to train radar ob- 
servers should be purchased by the Air Force, 
which secretary should be hired, and which 
of two types of visual aids should be used by 
Air Force schools. 

Because of the complexity of the decision- 
making situation, it was impossible for an in- 
dividual or a discussion group to complete all 
eight problems efficiently within 16 minutes. 
The task was completed most expeditiously 
by splitting the group into two or more sub- 
groups and apportioning the problems among 
them.° 

Unpublished reports describing the develop- 
ment of the instrument indicate that measures 
of its interval consistency by the split-half 
technique were low (r = .37). 


Measuring the Variables 
F Scale 


Two items were eliminated from the original 
F Scale when it was observed that Air Force 
personnel were openly derisive regarding the 
one, while the other appeared outdated. Re- 
ported reliability measures of the original in- 
strument range from .81 to .91 with an aver- 
age of .87 (1, p. 251). Scores were tabulated 
and divided into high and low categories by 
dividing the distribution at the median. 


Conformity of the Leader 


The measure of conformity included the fol- 
lowing items: 


Supose you found yourself holding an opinion 
which was not in agreement with an opinion held by 
the other people in your crew and this opinion was 
on an issue of importance both to you and to the 
crew. In each set of statements below, check the one 
alternative which would best express how you would 
feel and what you would do if such a situation 
existed in your crew. 

1. I would feel: (check one) a. Not at all con- 
cerned about my differing from the crew opinion, 
b. Slightly concerned about my differing from the 
crew opinion, c. Moderately concerned about my 
differing from the crew opinion, d. Greatly con- 
cerned about my differing from the crew opinion. 

2. If I found that my opinion differed from that 
of this crew, I would feel: (check one) a. Just as 
sure of my own opinion, b. Less sure of my own 


5 The methods used by the crews in the present 
experiment were anticipated through observations of 
aircrew performance on similar tasks (13). 








348 


opinion, c. Much less sure of my own opinion, d. 
Very much less sure of my own opinion. 

3. If I found that my opinion differed from that 
of this crew, I would have: (check one) a. A desire 
to maintain my own opinion, b. Hardly any desire 
to change my own opinion to agree with that of the 
crew, c. A moderate desire to change my opinion to 
agree with that of the crew, d. A strong desire to 
change my opinion to agree with that of the crew, 
e. A very strong desire to change my opinion to 
agree with that of the crew. 

In calculating the total score, the alterna- 
tives of each question were assigned a weight 
of one to four or one to five, in accordance 
with the number of options. The distribution 
of scores was divided into high, medium, and 
low categories in accordance with the con- 
siderations described initially. The low range 
included scores 10, 11, 12, and 13. The cut- 
off scores were dictated by the necessity of a 
workable number of cases in the middle range. 


Group Attraction 


The index of group attraction followed di- 
rectly from Festinger’s definition of group 
cohesiveness; that is, the resultant of all 
forces acting on the members to remain in the 
group (2, p. 164). The term “group attrac- 
tion” was used rather than the more general 
term “cohesiveness” since the two questions 
involved in this index did not measure the 
forces acting to prevent the individual from 
leaving the group. 

The index was derived from the following 
questions: 

1. If one man from each crew were ordered to re- 
main at the home base, rather than to attend Survival 
School and you were the man chosen, how would 
you feel about this? (check one) a. I'd feel very 
lucky, b. I wouldn’t mind, c. It wouldn’t matter one 
way or another, d. I wouldn’t like it, but would not 
feel too strongly about it, e. I wouldn’t like it a bit. 

2. How would you feel about being transferred to 
another crew or about your crew being split up as 
replacements for other crews? 

The alternatives were the same in each of 
the items. 

In calculating the index of group attraction, 
the alternatives of each item were weighted 
from one to five, respectively. The score of 
each individual group member was obtained 
by a simple summation of the weighted re- 
sponses. The mean score of the group pro- 
vided the desired index. An index of the re- 


Robert C. Ziller 


Table 1 


Analysis of Variance of the Group Approach 
to the Problem and Group Score 


Average 
Score 
37 8.15 
20 10.82 
13 12.39 
8 14.13 
18 14.98 


Group 
Approach N 


Note.—-F = 18,03; df = 4 and 92; p > .01. 

liability of the measure was calculated using 
the split-half technique in which each crew 
was divided into two subgroups with approxi- 
mately the same number of officers and air- 
men. A Pearsonian correlation of .46 was 
obtained by correlating the average scores of 
the 96 pairs of subgroups. 

Group Flexibility 

An operational definition of group flexibility 
was derived from the five methods of ap- 
proach to the task: (@) group members 
worked individually with little or no interac- 
tion; (6) group worked as a unit discussing 
each problem in turn; (c) group began work- 
ing as a unit (as in Method 3) but, recog- 
nizing that there was insufficient time to com- 
plete the task by their initial method, changed 
to Method e; (d) group worked as a highly 
organized unit, limited the time spent on each 
problem to two minutes and appointed a time 
keeper and recorder; (e) group subdivided 
into two, three, four, or five sections and 
divided the problems accordingly. 

The frequencies with which each method 
was employed are given in Table 1. 

While the study is concerned primarily with 
process rather than product variables and the 
low reliability of the instrument notwithstand- 
ing, it was observed that the method of ap- 
proach placed definite limits on the score at- 
tained. This observation was easily tested 


6 Correct answers to the items were provided by 
the test writers. In cases where one answer sheet was 
submitted by the group, the group scores were ob- 
tained from these single sheets by applying a simple 


correction formula (R-—4W). When members of 
the same groups submitted answer sheets which over- 
lapped, the mean of the scores became the group 
score. 





Communication Restraints 


through a simple analysis of variance between 
method and group score (see Table 1). It 
was also apparent that the mean score in- 
creased in direct relation to the method em- 
ployed. The highest scores were attained by 
groups which used the subgroup or highly- 
organized group approach. 

Groups using the approaches which re- 
flected an awareness of the need to reorganize 
to meet: the time demands of the problem, 
Methods c, d, and e, were compared with 
groups using approaches @a and b. Reorgani- 
zation is used here in the sense that the 
aircrew was already organized as a highly 
integrated team with a single leader and was 
accustomed to performing as a single unit. 
Flexible crews were those who were able to 
overcome this organizational set and _ reor- 
ganize in accordance with the time demands of 
the new situation. Actually however, two in- 
appropriate sets seemed to have been in- 
volved: the usual group-test-situation set in 
which comparisons of answers are not per- 
mitted (Method a) and the set that crews 
must perform as a single free-discussion unit 
(Method 6). 


Group Confidence 


The individual group members indicated 
their degree of confidence in the group by re- 
sponding to the following question: 


How well do you think your crew will do in this 

problem-solving situation? 

—— superior: better than 90% of the other crews. 

—— above average: better than 70% of the other 
crews. 

—— average: about as well as 50% of the other 
crews. 

—— below average: 
other crews. 

—— low: only as well as 10% of the other crews. 


only as well as 30% of the 


The alternatives of the question were 
weighted from 1 to 5, with the lower score in- 
dicating the greater confidence. The split- 
half estimate of reliability was .55 (p = .001). 

The relationships among the three inde- 
pendent variables were calculated (Table 2). 
All correlations were low. Along with the 
moderate reliability estimates of the meas- 
ures, the evidence permits the conclusion that 
the experiment involves three relatively un- 
related predictors. It was interesting to note, 


Table 2 
Intercorrelations Among the Three 
Independent Variables 


I 


I Rigidity of the lender 
II Conformity of the leader 
III Group Attraction 


* Significant at the .01 level. 


however, that groups led by the more con- 
forming leaders tended to be more attractive. 


Results 


The data did not permit a pattern analysis 
involving the three independent variables sim- 
ultaneously as this would have generated cells 
without cases. Therefore, chi-square tests 
were Calculated from the 2 X 2 tables formed 
by tabulating the frequency with which 
groups, dichotomized according to the inde- 
pendent variables, used the flexible or non- 
adaptive approaches to the task. 

The analysis of the data with regard to the 
relationship between the leader’s F-scale score 
and the criterion of group flexibility is pre- 
sented in Table 3. The differences with re- 
gard to the flexibility criterion were in the 
expected direction but statistically significant 
only at the 10 per cent level of confidence. 
Nevertheless, it was noted that 50 per cent of 
the ow groups used the approaches which re- 
flect flexibility as compared with 31 per cent 
of the Aigh groups. 

With regard to the association between 
leader conformity and group flexibility (Table 


Table 3 
The Relationship Between the Leaders’ F-Scale Scores 
preemmesnanmiad and since Flexibility 





Group Approach 





F Scale a b ¢ d e 


High a. se oo 8 
Low 15 8 9 4 10 
Chi square (a and b vs. c, d, and e) = 3.43; p = .10 





Note.—The total number of groups neue. using each 
pproach is not always consistent with the number in Table 1 
9 to incomplete data with regard to two crews. 








350 


Table 4 


The Relationship Between the Leaders’ Conformity 
Indices and Group Flexibility 








Group Approach 





Conformity a b ¢ d 





High oe 
Medium “ti Se kos ae 
Low A. ee oa oe 


Chi square (a and 6 vs. c, d, and e) = 6.33; p = .05 





Meta. —The total number of groups reported using Approach 
d is not consistent with the number reported in Table 1 due to 
incomplete data. 

High and low conformity frequencies were pooled throughout. 


4), it was seen that a greater percentage of 
groups in which the leader’s C-scale scores in- 
dicate moderate conformity used approaches 
reflecting greater flexibility. The approaches 
representing a high degree of flexibility were 
used by 52 per cent of the moderate groups 
but only by 27 per cent of the high-low 
groups. Thus, it was found that groups in 
which the leader was moderately concerned 
about the group’s opinions in comparison with 
those groups in which the leader was extremely 
concerned or unconcerned were more flexible 
in this problem-solving situation. 

The results with regard to the relationship 
between group attraction and group flexibility 
were not statistically significant and no trend 
was in evidence (see Table 5). 

It was apparent, however, that the measure 
of group flexibility employed was not suf- 
ficiently sensitive; far too many groups were 
required for analysis. It is entirely possible, 
however, that groups in a setting other than 
the military would avoid the approach to the 
task in which the group acted merely as an 


Table 5 


The Relationship Between Group Attraction 
and Group Flexibility 








Group Approach 
Group 
Attraction a b ¢ d e 


High 18 9 11 9 
Low 19 10 2 7 9 


Chi square (a and b vs. c, d, and e) = .29; p = .70 











Robert C. Ziller 


Table 6 


Analysis of Variance of Group Confidence According 
to Leaders’ F-Scale Scores and Group 
Attraction Indices 








Variance 
df Estimate F 


16.62 1.88 
149.56 16.90 d 
35.88 OS 05 
8.85 


Source of 
Variance 





Leader F Scale 1 
Group Attraction 1 
Interaction 1 
Groups 90 





Note.-—-When leaders’ F-scale scores were categorized as high 
and low, and Group Attraction scores were similarly divided, 
the mean Group-Confidence indices for groups within the cate- 
gories were (lower scores indicate greater confidence in the 
group): 

Group Attraction 


High 


Leader 
F-Scale Score 


High 


Total 
2.00 


Low 


1.80 2.19 
(N = 23) (N = 25) 


Low 1,97 2.09 2.03 
(N = 25) (N = 21) 


Total 1.89 2.14 


uncommunicative collection of individuals. 
In this event, the time required to shift to the 
subgroup approach to the task might serve as 
a superior measure of group flexibility. Of 
course, it still remains to be shown to what 
extent flexibility, as defined in the present 
study, is generalizable. 


Table 7 


Analysis of Variance of Group Confidence According 
to Leaders’ Conformity Scores and 
Gone 4 Attraction Indices 








Variance 
df Estimate F 


Source of 
Variance 





2 23.88 
1 91.60 
2 2.91 

8.98 


Note.—When Leaders’ Conformity scores were categorized as 
high, medium, and low, and Group Attraction indices were 
dichotomized, the mean Group Confidence indices for groups 
within the categories were (lower scores indicate greater confi- 
dence in the group): 


2.66 
10.20 


Leader Conformity 
Group Attraction 
Interaction 
Groups 88 





Leader Conformity 
Group 


Attraction High Medium Low Total 


High 1,77 1,93 1.93 1.89 
(N =12) (N = 30) (N = 6) 





2,23 2.14 
(N = 20) 


2.16 2.01 


2.10 
(N = 20) 
2.00 


2.00. 
(N = 6) 
1.85 





Communication Restraints 


Table 8 


Analysis of Variance of Group Confidence According 
to Leaders’ F-Scale and Conformity 
Scale Scores 


Variance 
Estimate 


Source of 
Variance df 


50.52 
1.64 
4.90 
9.37 


Leader Conformity 2 
Leader F Scale 1 
Interaction 2 
Groups 88 
Note.—When leaders’ conformity scores were categorized as 
high, medium, and low, and leaders’ F-scale scores were dichoto- 
mized, the mean Group Confidence indices for groups within 
the categories were (lower scores indicate greater confidence in 
the group): 
Leader 
F-Scale - 
Score High Total 


High 1.65 1,98 2.21 2,00 
(N = 8) (N = 24) (N = 16) 


Leader Conformity 


Medium Low 


Low 2.03} 


2.00 2.01 2.09 
(N = 10) (N = 26) (N = 10) as 


Total 1.84 2.00 2.16 2.01 


In analyzing the data with regard to group 
confidence, factorial designs for cells of un- 
equal size were employed (11, p. 285). (In 
Tables 6, 7, and 8 the entries in each cell are 
the mean group confidence scores for the 
groups falling in the category.) The smaller 
means indicate greater confidence. 

The results with regard to group attraction 
and leader conformity were statistically sig- 

ificant and indicate that members of the 

more attractive groups whose leaders claim 
that they tend to conform to the opinion of 
the majority express greater confidence in 
their group’s ability to succeed in a new situa- 
tion. 

The main effects of leader F-scale scores 
were not significant. However, when leader 
F-scale scores and group attraction indices 
were studied conjointly, interaction effects 
were significant (see Table 6). The greatest 
confidence was expressed by attractive groups, 
the leaders of which had high F-scale scores. 

While group productivity was not central 
to the study, nevertheless the relation between 
the independent variables and the group scores 
was explored by means of analysis of vari- 
ance designs similar to those in Tables 6, 7 
and 8. As might have been anticipated on 
the basis of the low reliability index of the 


351 


performance test, the results were not sta- 
tistically significant nor were there any ap- 
parent trends. 

Discussion 

In terms of the communication framework 
sketched at the outset, both the more flexible 
groups and the more confident groups were 
generally found to be those with the more 
open communication systems or communica- 
tion systems with relatively fewer restraints 
on the member’s interaction. Thus, the re- 
sults with reference to group flexibility sug- 
gest that groups with moderately conforming 
and Low F leaders adapted more readily to 
the requirements of the new situation. The 
more confident groups were those with highly 
conforming leaders and those to which the 
group was highly attractive for the members. 

The apparent inconsistency in the findings 
regarding the conformity of the F leader was 
anticipated and interpreted as a qualification 
rather than as a directly contradictory re- 
sult. That is, it was anticipated that while 
overly nonconforming leaders would tend to 
restrict communication among the members, 
the overly conforming leader, on the other 
hand, would tend to lack the confidence neces- 
sary to initiate group action. Thus, the char- 
acteristics of the leader which the group mem- 
bers find desirable—high conformity—may 
not be desirable with reference to other 
criteria—group flexibility. 

The relationship between the independent 
variables presumed to be related to open com- 
munication systems and subsequently found 
to be related to group confidence corroborates 
the results of earlier experiments involving 
communication networks. In these earlier 
studies, it was revealed that, in general, mem- 
bers of groups with more open communica- 
tion networks tend to have high group morale 
(9, 10). However, it should be emphasized 
that the open communication systems in the 
present study are presumed to be psycho- 
logically determined rather than mechanically 
determined as was the case in the earlier 
studies. 

With reference to the specific independent 
variables, the main effects of leader F-scale 
scores were not significant with regard to 











352 Robert C. Ziller 


group confidence. However, when leader F- 
scale scores and group attraction indices were 
analyzed conjointly, interaction effects were 
significant (see Table 7). The greatest con- 
fidence was expressed in attractive groups, the 
leaders of which had high F-scale scores; 
while the least confidence was expressed in low 
attraction groups, the leaders of which had 
high F-scale scores. If high group attraction 
is interpreted as indicating homogeneity of the 
members’ F-scale scores, the results are in 
agreement with those presented in an earlier 
study relating homogeneity of the members’ 
F-scale scores to group morale (4). 

Contrary to a number of previous studies 
(1, 8, 12) the results reported above suggest 
a positive aspect of “authoritarianism.” Pre- 
sumably, the positive or negative aspects of 
authoritarianism with regard to the leader 
emerge only in conjunction with other leader 
and group characteristics and under particular 
task demands. 


Summary and Conclusions 


This study was designed to explore the re- 
lationship between selected group structure 
variables and the group’s ability to adjust 
to the requirements of a new situation (group 
flexibility) and the group members’ expressed 
confidence in the ability of the group to suc- 
ceed in a problem-solving situation. The in- 
dependent variables selected for analysis in- 
cluded the F-scale scores and “conformity” 
of the assigned leader and group attraction. 

The Ss were 96 aircrews comprising about 
1,000 men. The group task required the com- 
pletion of an eight-item intelligence examina- 
tion in a period of time which demanded a 
change in the group’s customary operating 
procedure. 

In the more flexible groups the leaders 
scored low on the F-scale and moderately high 
on a scale of conformity. It was also found 
that greater confidence in the group was ex- 
pressed by members of high attraction groups 
and groups whose leaders tended to conform 
to the group members’ opinions. In addition, 


interaction effects of leader F-scale scores and 
group attraction with regard to group con- 
fidence were statistically significant. It was 
concluded that groups with more open com- 
munication systems (group with fewer com- 
munication restraints) are more flexible and 
more confident. 


Received January 27, 1958. 


References 


1. Adorno, T. W., Frenkel-Brunswik, Else, Levinson, 
D. J., & Sanford, R. N. The authoritarian 
personality. New York: Harpers, 1950. 

2. Festinger, L., Schacter, S., & Back, K. Social 
pressures in informal groups: A study of a 
housing project. New York: Harpers, 1950. 

3. Hartley, E. L., & Hartley, R. E. Fundamentals 
of social psychology. New York: Knopf, 1952. 

4. Haythorn, W. The effects of varying combina- 
tions of authoritarian and equalitarian leaders 
and followers. J. abnorm. soc. Psychol., 1956, 
53, 210-219. 

. Hemphill, J. K. Situation factors in leadership. 
Ohio State Univer.: Bureau of Educational 
Research Monographs, 32, 1950. 

6. Kelley, H. H., & Shapiro, M. M. An experiment 
on conformity to group norms when con- 
formity is detrimental to group achievement. 
Amer. soc. Rev., 1954, 19, 667-677. 

7. Kelley, H. H., & Thibaut, J. W. Experimental 
studies of group problem-solving and process. 
In G. Lindzey (Ed.), Handbook of social 
psychology. Cambridge, Mass.: Addison-Wes- 
ley, 1954. 

8. Rokeach, M. Generalized mental rigidity as a 
factor in ethnocentrism. J. abnorm. soc. 
Psychol., 1948, 43, 259-278. 

9. Shaw, M. E. A comparison of two types of 
leadership in various communication nets. J. 
abnorm. soc. Psychol., 1955, 50, 127-134. 

10. Shaw, M. E., & Rothschild, G. H. Some effects 
of prolonged experience in communication 
nets. J. appl. Psychol., 1956, 40, 281-286. 

11. Snedecor, G. W. Statistical methods. (4th ed.) 
Ames, Iowa: Iowa State College Press, 1946. 

12. Stern, G. G., Bloom, B. S., & Stein, M. I. As- 
sessment of personality II: Studies of the re- 
lationship between personality syndromes, 
learning situations, and learning outcomes. 
Chicago: Univer. Chicago, 1952. 

13. Torrance, E. P. Crew performance in a test 
situation as a predictor of field and combat 
effectiveness. Washington, D. C.: Human 
Factors Operations Research Laboratories, 
1953. (HFORL Report No. 33.) 


an 








Journal of Applied Psychology 
Vol. 42, No. 5, 1958 


The Comparative Effectiveness of Some Psychological and Physiological 
Measures in Ranking the Impact of Diverse Environmental 
Conditions * 


Bernard J. Fine 


QM Research and Engineering Center Laboratories, Natick, Massachusetts 


In the course of evaluating the thermal pro- 
tection qualities of military clothing, it is fre- 
quently necessary to measure differences in 
the responses of soldiers along several physio- 
logical dimensions. These measurements are 
generally taken, of necessity, under controlled 
conditions, and are not readily adaptable to 
field use. Furthermore, they are both time 
consuming and expensive since they involve 
the use of fairly elaborate equipment both in 
obtaining the measures and in processing the 
data. 

The present study investigates, in two parts, 
the use of subjective rating scales as sub- 
stitutes for the more complex physiological 
measurements. 

The subjective aspects of man’s thermal en- 
vironment have had little systematic study. 
Most of the work that has been done is psy- 
chophysical in nature and involves, primarily, 
the determination of thresholds of temperature 
sensation (4, 6, 7, 8). Several studies have 
investigated the relationships between person- 
ality variables and subjective responses to 
various environmental factors (1, 3, 10, 11). 
There have also been some attempts to get at 
the psychological and physiological bases of 
subjective reports of warmth or coldness (1, 
2, 5, 11). Insofar as is known, there have 
been no studies comparing psychological with 
physiological measurements in their effective- 
ness in ranking a number of different en- 
vironments according to the thermal state of 
individuals in those environments. Such a 
comparison is reported in the research de- 
scribed below. 


1 The author wishes to thank P. Iampietro for his 
cooperation in making the physiological data avail- 
able and for his helpfulness with regard to the physio- 
logical aspects of the study. 


Study Number One 
Method 


Subjects. The subjects (Ss) were six enlisted men 
stationed at the Quartermaster Research & Engineer- 
ing Command as volunteer test subjects. They are 
described in greater detail in a paper by Iampietro 
et al. (9). They were considered to be “normal” on 
the basis of psychological and physiological measure- 
ments. 

Procedure. The six Ss, clad only in shorts, were 
exposed as a group to different environmental condi- 
tions on eight successive days (excluding one week- 
end). The conditions represented combinations of 
ambient temperatures of 50 and 60 degrees F. with 
humidities of 30 and 95 per cent and wind speeds of 
zero and 10 miles per hour. The order in which the 
conditions were presented was randomly determined 
Each exposure lasted two hours each day. A one- 
hour “control” period preceded each exposure and a 
one-half hour “recovery” period followed each ex- 
posure. Physiological and psychological measures 
were taken during the control, exposure, and recov- 
ery periods. 

The physiological measures which are considered 
in this report are mean weighted skin temperature 
(MWST) and increase in metabolic rate (AMR). 
These are discussed fully in the paper by Iampietro 
et al. (9) as is the methodology in general. Briefly, 
MWST represents an average skin temperature based 
on readings obtained from various parts of the body. 
Metabolic rate increase, as used herein, refers to the 
average increase in metabolism (cal./hr.), during ex- 
posure, over the reading obtained in the control 
period, which is an approximate basal reading. The 
average increase in MR is based on four MR meas- 
ures taken during the exposure period. 

The psychological measure consisted of a paper- 
and-pencil subjective rating scale on which the S 
indicated how various parts of his body (head, 
chest, back, arms, hands, legs, and feet) felt on a six- 
point scale ranging from “very hot” to “very cold.” 
Each S indicated how the body parts felt at half- 
hour intervals during the control, exposure and re- 
covery periods. Only the four exposure ratings are 
considered in this report. 


Results 


Agreement between subjects—skin tempera- 
ture. A MWST reading of each S was ob- 


353 








354 


tained at the end of each two-hour exposure 
period. The eight conditions were ranked 
from warmest to coldest for each S on the 
basis of his MWST’s. A Kendall coefficient 
of concordance (12) calculated from the. rank- 
ings for the six Ss yielded a W of .91 which 
is significant at better than the .01 level. 
This indicates that there is a high degree of 
agreement between the six sets of rankings. 

Agreement between subjects—metabolic 
rate increase. An average AMR for each S 
during each two-hour exposure period was 
obtained. The eight conditions were ranked 
from warmest to coldest for each S$ on the 
basis of the AMRs. A coefficient of concord- 
ance calculated from the six sets of rankings 
yielded a W of .83 which is significant at 
better than the .01 level indicating a high 
amount of agreement between the six sets of 
rankings based on MR. 

Agreement between subjects—subjective 
rating scale. Numbers were assigned to the 
six points on the subjective rating scale, and 
each S’s ratings for all body parts for the four 
exposure administrations of the scale were 
averaged. An average subjective response for 
each S for each experimental condition was 
thus obtained. On the basis of these aver- 
ages, it was possible to rank the eight condi- 
tions for each S in terms of how warm or cold 
they made him feel. The coefficient of con- 
cordance calculated from the six sets of rank- 
ings yielded a W of .76 which is significant at 
better than the .01 level and indicates a high 
level of agreement between the six sets of 
rankings based on subjective ratings. 

Comparison of subjective and physiologi- 
cal measures. Having shown that there is a 
high degree of agreement between Ss in rank- 
ing the eight conditions according to their 
subjective and physiological responses, it was 
then permissible to determine the extent of 
agreement between the subjective and physio- 
logical methods of ranking the conditions. 

To do this, the individual responses were 
averaged so that a mean group MWST, AMR, 


and subjective rating was obtained for each . 


of the eight conditions. The eight conditions 


2 The numbers were assigned to the words strictly 


as a convenience in ordering the data. No assump- 
tion was made regarding the equality of the intervals 
of the subjective scale. 


Bernard J. Fine 


Table 1 


Comparison of Rankings of Conditions by MWST, 
AMR, and Subjective Methods* 


MWST 
Method 


AMR 
Method 


Subjective 
Method 


Conditions? -* 


5 (1.83) 
1 (2.73) 
8 (1.00) 
2 (2.62) 
6 (1.73) 
3 (2.10) 
7 (1.62) 
4 (1.85) 


5 (85) 
1 (18) 
8 (171) 
2 (38) 
6 (89) 
3 (39) 
7 (119) 
4 (74) 


50/30/0 
60/95/0 
50/30/10 
60/30/0 
60/95/10 
50/95/0 
50/95/10 
60/30/10 


3 (79.8) 
1 (83.8) 
7 (69.8) 
2 (81.3) 
6 (74.0) 
4 (79.4) 
8 (69.4) 
5 (74.8) 


®* The numbers in parentheses refer to the average upon 
which the ranking is based. MWST averages are in degrees F.; 
AMR averages are in cal./hr.; and subjective averages are 
based on the 1 through 6 ratings on the subjective scale. 

>In a condition designation, e.g., 50/30/0, the first figures 
refer to ambient temperature, the second to percentage of 
humidity and the third to wind speed. 

¢ Conditions are listed in the order in which they were pre- 
sented. 


were then ranked from “warmest” to “cold- 
est” according to the average measurements. 
Thus, three sets of rankings of the conditions 
were obtained; one based on MWST, one on 
AMR, and one on subjective ratings. It was 
then possible, using the Spearman rank cor- 
relation technique (12), to calculate the de- 
gree of agreement between the subjective and 
physiological methods of ranking the condi- 
tions. The group averages and rankings for 
the MWST, AMR, and subjective rating 
methods are shown in Table 1. The rank 
order correlation between the subjective rat- 
ings and MWST yielded an r, of .95 and that 
between subjective ratings and AMR an 
r, of 1.00. These correlations indicate a high 
degree of agreement between the subjective 
rating scale and each of the physiological 
measures in ranking the conditions from 
warmest to coldest. 


Study Number Two 
Method 


Subjects. The Ss were six enlisted men stationed at 
the Quartermaster Research & Engineering Command 
as volunteer test subjects. One S had participated 
in Study Number One. 

Procedure. The procedure was exactly the same 
as described in Study Number One except that the 
men were clothed in the standard Army fatigue 
uniform and the ambient temperatures were 40 and 
50 degrees F. 





Psychological and Physiological Measures 


Table 2 


Comparison of Rankings of Conditions by MWST, 
AMR, and Subjective Methods 


MWST 
Method 


AMR 
Method 


Subjective 
Conditions Method 
50/30/10 6 (2.25) 
8 8 (1.80) 
40/95/0 3 (51) 3 (2.83) 
50/95/10 6 (74.9) 5 (79) 5 (2.50) 
40/30/0 4 (78.8) 4 (59) 4 (2.65) 
1 1 
2 2 
7 7 


(75.6) 6 (81) 


(101) 


5 
40/95/10 7 (72.0) 
3 


(79.5) 


50/95/0 1 (81.7) (27) (3.08) 
50/30/0 2 (80.8) (34) (2.87) 
8 


40/30/10 (71.4) (93) (2.05) 


Results 


Agreement between subjects. As in Study 
Number One, coefficients of concordance were 
calculated from the rankings based on MWST, 
AMR and subjective ratings. These were 
.86, .75, and .85, respectively. All three are 
significant at better than the .01 level and 
indicate a high amount of agreement between 
the six sets of tankings with respect to MWST, 
AMR, and subjective ratings. 

Comparison of subjective and physiological 
measures. The data were treated the same 
way as in Study Number One. The com- 
parisons between the MWST, A MR, and sub- 
jective rating methods are shown in Table 2. 
The rank order correlation between subjective 
ratings and MWST yielded an r, of .93 and 
that between subjective ratings and A MR an 
r, of 1.00. As in Study Number One, the cor- 
relations indicate a high degree of agreement 
between the subjective rating scale and each 
of the physiological measures in ranking the 
conditions from warmest to coldest. 


Discussion 


The results of both studies indicate that if 
it is necessary to compare environmental con- 
ditions and/or order them according to the 
warmth or coldness of groups exposed to them, 
the subjective rating scale method is as ef- 
ficient as either of the two physiological 
methods. The subjective method is also 
simpler to administer and analyze for scoring. 


355 


It should be noted that when actual physio- 
logical values are required, the subjective 
scales are relatively inaccurate in their ap- 
proximations and cannot replace the physio- 
logical measurements. 

On the other hand, when an assessment of 
Ss’ thermal “feelings” is desired, for other 
than comparative purposes, the appropriate 
measure is the subjective one. One cannot 
accurately infer actual subjective comfort 
from physiological measurements. 

While the two studies did not investigate 
the comparison of physiological and psycho- 
logical responses of groups wearing different 
clothing in a constant environment, the re- 
sults indicate that the subjective method 
should be as efficient as the physiological 
methods in such instances for comparative 
purposes so long as MWST’s are not expected 
to drop below approximately 65 degrees F. 
There is some evidence that below certain 
MWST’s, the subjective scale loses some of its 
accuracy. The precise point at which the 
subjective scale used herein becomes inac- 
curate has not yet been established. 


Summary 


Two studies are presented, both of which 
compare the effectiveness of a subjective rat- 
ing scale with two physiological measures, 
mean weighted skin temperature and average 
increase in metabolic rate, in ranking eight 
environmental conditions (varying in ambient 
temperature, humidity and wind speed) from 
warmest to coldest. 

The results of both studies indicate a high 
degree of consistency between individual re- 
sponses on each measure and a high degree 
of agreement between the subjective rating 
method and both mean weighted skin tem- 
perature and metabolic rate methods of rank- 
ing the conditions. ' 

It is noted that the subjective scale might 
be used instead of the physiological measure- 
ments for the purpose of comparing condi- 
tions with regard to the relative warmth or 
coldness of groups of individuals within the 
conditions, but only for that purpose. 
Received January 29, 1958. 


8 John McGinnis, Personal communication. 1957. 








356 


References 


1. Blair, J. R., Urbush, F. W., & Reed, I. T. Pre- 
liminary observations on physiological, nutri- 
tional, and psychological problems in extreme 
cold. Fort Churchill, Canada. Proj. No. 57-3, 
Field Res. Lab., Ft. Knox, Ky., 1947. 

2. Chagas, C., Mortara, G., & Borges Sampaio, F. 
Alguns estados sobre e sensibilidade termica. 
An. Acad. Brazil Cienc., 1947, 19(1), 71-102. 

3. Debons, A. Survey of human adjustment prob- 
lems in the northern latitudes. Personality 
predispositions of infantrymen as related to 
their motivation to endure tour in Alaska. A 
comparative evaluation. Arctic Aeromed. Lab., 
Ladd AFB, Alaska, 1950. (Proj. Rep. No. 
21-01-022.) 

4. Ebaugh, F. G., Jr., & Thauer, R. Influence of 
various environmental stimuli on cold and 
warmth thresholds. J. appl. Physiol., 1950, 
3, 173. 

5. Freedman, A., & Horvath, S. M. Cold weather 
operations. Test of the adequacy and range 
of use of winter clothing. Study of methods 
for selection of men for cold weather opera- 
tions. Proj. No. 1-1: 1-18, Armored Med. 
Res. Lab., Fort Knox, Ky., 1944. 


6. 


10. 


11. 


12. 


Bernard J. Fine 


Hardy, J. D., Goodell, H., & Wolff, H. G. In- 
fluence of skin temperature upon pain thresh- 
old as evoked by thermal radiation. Science, 
1951, 114, 149-150. 


. Hendler, E., & Hardy, J. D. Temperature sensa- 


tions accompanying changes in skin tempera- 
ture. NAMC-ACEL-350, Proj. No. NM 
1701132, Naval Air Mat. Ctr., Phila., 1957. 


. Herget, C. M., Granath, L. P., & Hardy, J. D. 


Thermal sensation and discrimination in rela- 
tion to intensity of stimulus. Am. J. Physiol., 
1941, 134, 645. 


. Iampietro, P. F., Bass, D. E., & Buskirk, E. R. 


Heat exchanges of nude men in the cold: 
Effect of humidity, temperature and wind 
speed. J. appl. Phystol., in press. 

McCollum, E. L. Personality alteration during 
reduced caloric intake under survival condi- 
tions in the sub-Arctic. Arctic Aeromed. Lab., 
Ladd AFB, Alaska, 1950. (Proj. Rep., No. 
21-01-025.) 


Patrick, J. R., & Rowles, E. Intercorrelations 


between metabolic rate, vital capacity, blood 
pressure, intelligence, scholarship, personality 
and other measures on university women. J. 
appl. Psychol., 1933, 17, 507-521. 
Siegel, S. Nonparametric statistics. 
McGraw-Hill, 1956. 


New York: 








-—-—----------- Sali 


What happens (or should happen) when a supervisor 


interviews an employee regarding his work ...?—READ... 


THE APPRAISAL INTERVIEW 


By NORMAN R. F. MAIER, Professor of Psychology, University of Michigan 


This new book offers a penetrating : 
analysis of the cause and effect relations 
that exist in an interview situation de- 
signed to appraise employee efficiency. 


The author thoroughly explores the 
styles of appraisal interviews, each of 
which requires different skills and is de- 
signed to meet different objectives. The 
first two—the “Tell and Sell” and the 
“Tell and Listen” methods—deal with em- 
ployee development. The third method— 
“Problem-Solving”—is concerned with job 
improvement. 


The ~_ ry Interview describes the 
problems faced by the interviewer, the 
skills needed to make the interview suc- 
cessful by overcoming these problems, and 
the importance of clarifying the objectives 
of the interview in order to reach a pro- 
ductive conclusion. The dangers invelved 
are pointed out, showing how an interview, 
if improperly handled, can do more harm 
than good. The book shows why inter- 
views go wrong and what can be done to 
save them. Actual transcriptions of inter- 
views, conducted by different methods, 
have been included to illustrate specific 
problems. 1958. 246 pages. $5.95. 


Send for an examination copy. 


Y. 


ae sane es as sed 


4 


Snes 











PSYCHOLOGICAL 
MONOGRAPHS 





A sample of monographs still in print: 


Factors Related to Success in the Human Relations Aspect of Work-Group 
Leadership, by H. H. Mrvzer. No. 320. $1.00 


Occupational Aptitude Patterns of Selected Groups of Counseled Veterans, by 
W. L. Barnette, Jr. No. 322. $1.50 


The Academic Achievement of Veteran and Non-Veteran Students, by NoRMAN 
FREDERIKSEN and W. B. Scuraper. No. 347. $1.50 


Errors in Time-Study Judgments of Industrial Work Pace, by K. A. Lirson. 
No. 355. $1.00 


The Prediction of Effectiveness as a Factory Worker, by M. M. Bruce. No. 
362. $1.00 


Job Satisfaction as Related to Need Satisfaction in Work, by R. H. ScHAFFER. 
No. 364. $1.00 


Improving the Employability and Attitudes of “Difficult to Place” Persons, by 
ALFrrep Fetmntucnw. No. 392. $1.00 





Order now before these monographs go out of print. 
Write for complete list of items still available. 


American Psychological Association 
1333 Sixteenth Street, N. W. 
Washington 6, D. C. 











